Gene Noca_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2021 
Symbol 
ID4598643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2165981 
End bp2167549 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content75% 
IMG OID639776625 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_923218 
Protein GI119716253 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0398286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGCTGC CGCTGCCCGG GACGGAGCCG GCAGCCCGCA CGCACCCGCA GCGTCCCGAG 
CCCGCTCCGC CATTCCTCCT GGCGCTGGGC GCGCTGGTCA CCGTCGCCTG CCTGATCCCG
CTCGGGTACG TCGTGTGGTC GGTCGCCGAC GTCGGCCTGG CCGAGGCCCG CGACTACCTG
TTCCGGCCCC GGATCGGCGA GCTGCTGTGG AACACCTCCC GGCTGCTGGT CGGCGGCGTG
GCCCTCAGCG TCGTGCTCGG CGTCGGGGGT GCCTGGCTGG TCGAGCGCAC CGACGTCCCC
GGCCGGGGCT GGTGGCACGG CTTGATGTGC GCGCCGCTCG CCGTACCCGC CTTCGTCAAC
GGCTACGGCT GGGTCTCGAT GACGCACGCG GTGCAGAGCT ACGGCGGCGC CGCGCTCGTC
GTCAGCCTGT CGTACTTCCC GTTCGTCTAC CTGCCCACCG TCGCGGCCCT GCGCCGCCTC
GACCCGAGCC TGGAGGAGGT CAGCGCCTCC CTGGGACAGC GGCCGCTCGC GACGTTCCTG
CGGGTGGTGC TGCCCGCGAT CAGCCCCGCC GTGCTCGGCG GCGCGCTGCT GGTCGGCCTG
CACCTGCTGG CGGAGTACGG CGCCCTGCAG CTGCTCAACT ACCCCACCCT GACCACGGCG
ATCCTGCAGC AGTACGGCAC CTCCTTCAAC GGCCCGGCCG CCAGCCTGCT CGCGCTCGTC
CTCGTCGTGT TCTGCCTGGC GCTGCTGGCC GTCGAGCTGC TGCTGCGTGG CCGCGGCCGC
CGGGCCCGGG TCGGCTCGGG CGCCGCCCGC GCCGCCGACC CGCACCGCCT CGGCCGCGGG
CGGCTCCCGG CGGCGGGTGG GCTCGCCGCG CTCGTCGTAC TCGCCCTCGG CGTGCCGCTG
CTCAACCTGG CCCGGTGGCT GGTGCGCGGC TCCTCGACCC GGTGGGACCT GCCCGACCTG
ACCTCGGCCA TCGCGACCTC CGTCGGCCTC GCCGTGCTCG CCGGTCTGGT CGCCACGGCC
GCGGCGACAC CGGTCGCGTG GCTCTCGGTG CGGCACCGCG GCGGGCTGAC CACGACGCTG
GAGCGGGCCA CCTACACCGC CAGCGCGATG CCCGGCATCG TGGTCGCGCT CGCCCTGGTC
ACGGTGTCGA TCCGTGCCGT CCCCGCGCTC TACCAGACCG TGCCGCTGCT GGTGATCGGC
TACGTCATCC TGTTCCTGCC GCGCGCGGTC GTGAGCCTCC GGCCGACCAT GGAGCTCGCG
CCGCCGGTGC TCGAGGACGT GGCGCGCTCG CTGGGCTGCG GCCGCACCGG CGTCGGCCTC
CGGGTGACCG CGCCACTCAT CGCGCCCGGC CTCGCGGCCG GCTTCGCGCT CGTGTCCCTG
GCCGTGTCGA CCGAGCTGAC CGCCACTCTG CTGCTCGCCC CGATCGGCAC CGACACGCTC
TCCACGGAGT TCTGGTCCAA GGCCTCCTCC GTCGCGTACG GCGCGGCCGC GCCGTACGCC
CTGGCGCTCG TGGTGCTCTC GGTCCCGGCG ACCTGGCTGC TCAGCCGAGT CACGGTGGGT
GCGCGATGA
 
Protein sequence
MQLPLPGTEP AARTHPQRPE PAPPFLLALG ALVTVACLIP LGYVVWSVAD VGLAEARDYL 
FRPRIGELLW NTSRLLVGGV ALSVVLGVGG AWLVERTDVP GRGWWHGLMC APLAVPAFVN
GYGWVSMTHA VQSYGGAALV VSLSYFPFVY LPTVAALRRL DPSLEEVSAS LGQRPLATFL
RVVLPAISPA VLGGALLVGL HLLAEYGALQ LLNYPTLTTA ILQQYGTSFN GPAASLLALV
LVVFCLALLA VELLLRGRGR RARVGSGAAR AADPHRLGRG RLPAAGGLAA LVVLALGVPL
LNLARWLVRG SSTRWDLPDL TSAIATSVGL AVLAGLVATA AATPVAWLSV RHRGGLTTTL
ERATYTASAM PGIVVALALV TVSIRAVPAL YQTVPLLVIG YVILFLPRAV VSLRPTMELA
PPVLEDVARS LGCGRTGVGL RVTAPLIAPG LAAGFALVSL AVSTELTATL LLAPIGTDTL
STEFWSKASS VAYGAAAPYA LALVVLSVPA TWLLSRVTVG AR