Gene Cag_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1014 
Symbol 
ID3746742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1356053 
End bp1357996 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content41% 
IMG OID637773543 
ProductTonB-dependent receptor-related protein 
Protein accessionYP_379319 
Protein GI78188981 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA AAGTATTTGT GGTTGTCATG GCGGGTTTGC TATGCAGCAA GGGGCTTATG 
GCGGCTGATG GGCAGCCGAT GAGCGTTATG GGTGAGGAGA TGGTGGTTAC TTCAAGCCGT
TTTGAGGAGC CAAAGAAAAA TCTCACATCG AACATTACCA TTATCGGTAA AGATGAGATT
GCTCAATCGT CGGCTCAAGA TTTGGGGGAA TTGTTGGCTG AAAAAAATCT TGGGCAAGTT
CAGAAATATC CCGGCACGAT GACTTCGGTT GGTATTCGAG GTTTTAGAAG TGAAACGCAT
GGCAACGATT TAATGGGTAA AGTTTTAGTG CTTATTGATG GTCGGCGTGC AGGTACGGGC
AATACGGCTA AAATTATGAC CGAGAATGTT GAGCGTATTG AAATTATTCA AGGTCCAGCG
GCAGTGCAAT ATGGTTCAGC CGCAATAGGT GGAGTAGTAA ATGTTATTAC TAAAAAAGGC
GATAATGTAC CTTCATTTTT TGTAGAACAA AAAGGGGGAA GCAATGAATT TGTGAAAACG
GCTGCCGGTG TGCAGGGCAA AATTGGTAAG CTTGATTTTG CAAGTGCCCT TTCGCGCTCT
GAAGCTGGTG ATTATAAAAC GGGTTCTGGT AAAACATATT TTAATACGGG ATATGATGAG
CAGGTTTTAG CTAATCTTAA TGTTGGCTAT GAAATTGCTG ATGGGCATCG TATCGGGTTT
GGATATCATT CATTTGATGT AGATAAAGCT GGATCGCCAT CATATCTTAG CCTGAATGAC
TTACAAAGTT ATACAAGCAA TAACAATCAT GCTATTGATG TTAGCTATGA AGGTGCTCTT
ACCAATAAAC GTTGGTTATG GTCAACTCGC TATTTTAGTG GTATGGATAG CTATCAATAT
GTTGATCCAT CAACTTCTTA TACAAGTTCG AGTGATGTGG AACAGCAAGG TGCTCAAGCG
CAAATTTCAT TTACCGAAAA GAGCTTGCTT ATAACGGCTG GAGTTGATTG GCTAAATTAT
GAATTAACCT CAACGTTAGC CCCTAAGTGG AGTAAATACA ATAACCCAGC CGTTTTTTTG
CTTGCTAAAT ATGGAGTGTT TGAGGATCGG CTTTTGCTTT CGGCTGGTGT TCGCTATGAT
GATTATAAGG TTGATTTACA ACCTGCGGAA GGCACCTCGC GTAGTACGGA TAATTTTGCC
CATCAAGTAG GAGCTGCATG GCAAGTGAAT GATGTTGTAA AACTACGCGC TTCTTATGCC
GAAGGTTTTA GAATGCCATC GGCACGTGAA CTTGCTGGAA ATATTGTGTC GTTTGGTAAA
ACTTATATTG GAAATCCTAA TTTAAACCCT GAAGTTAGCG AAACATGGGA AACAGGTATT
GATGTTGTAT GGAGAGAGAT TACATCATCA CTTACATGGT TTTCAACCGA TTACACCGAT
ATGATTGAAA CCCAGCTCAC TGCTCCTAAA ACATATCTCT ATAAAAATAT TGGTAGTACT
TCGTTGTCGG GTATTGAAGC TGAATTTGCA TGGAAAAGTT CTGCAACTAC TTGGAACATT
GAGCCTTACG TAAACTACAG CTATCTCTTA GAGCATAAAG ATAATGCTAC GGGCGATGAT
TTGCTTTATA CCCCTGAGTG GAATGCCAGC ACAGGAGTGC GTCTTCAGCA TACGAATGGC
TTAAGTGCTG CACTGAATGT TACCGCAACG GGTAGTAGTA ATGTGCAAGA TTATGAAAGC
AATTCAGGTA AAGTGATTAC GAAGGGTGGC TTTAGTGTTG TAAATCTTTC AGCATCGAAA
AAATTTACGC TTGATAAACA AGAGCGTCGT GCTATTACTA TAAAAGCTGA GGTTGATAAT
TTGCTTGATC GCGATTATCA ATACGTTAAA GGTTATCCCA TGCCCGGACG CACGTTTGTG
ATTGGGTTGC GTGCCGACAT CTAA
 
Protein sequence
MNKKVFVVVM AGLLCSKGLM AADGQPMSVM GEEMVVTSSR FEEPKKNLTS NITIIGKDEI 
AQSSAQDLGE LLAEKNLGQV QKYPGTMTSV GIRGFRSETH GNDLMGKVLV LIDGRRAGTG
NTAKIMTENV ERIEIIQGPA AVQYGSAAIG GVVNVITKKG DNVPSFFVEQ KGGSNEFVKT
AAGVQGKIGK LDFASALSRS EAGDYKTGSG KTYFNTGYDE QVLANLNVGY EIADGHRIGF
GYHSFDVDKA GSPSYLSLND LQSYTSNNNH AIDVSYEGAL TNKRWLWSTR YFSGMDSYQY
VDPSTSYTSS SDVEQQGAQA QISFTEKSLL ITAGVDWLNY ELTSTLAPKW SKYNNPAVFL
LAKYGVFEDR LLLSAGVRYD DYKVDLQPAE GTSRSTDNFA HQVGAAWQVN DVVKLRASYA
EGFRMPSARE LAGNIVSFGK TYIGNPNLNP EVSETWETGI DVVWREITSS LTWFSTDYTD
MIETQLTAPK TYLYKNIGST SLSGIEAEFA WKSSATTWNI EPYVNYSYLL EHKDNATGDD
LLYTPEWNAS TGVRLQHTNG LSAALNVTAT GSSNVQDYES NSGKVITKGG FSVVNLSASK
KFTLDKQERR AITIKAEVDN LLDRDYQYVK GYPMPGRTFV IGLRADI