Gene Cag_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0104 
Symbol 
ID3747592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp118404 
End bp119819 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content49% 
IMG OID637772630 
ProductElongator protein 3/MiaB/NifB 
Protein accessionYP_378425 
Protein GI78188087 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.016324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAAC ATGTAAAAAA AGTGGCGTTA GTTTTTCTCC CCTCAGAAAG CGGTGTTGAC 
GGCGCTCGTT CACTTTATGC AAACAAAGCG GCACAGCACC CGCTTAAAGA GTGGGCAAAC
AACGCGTTTC GTGGCATTAT TAAACGAAGC CAATTTGCCA TTCCCCCGCT TTCGTTGATG
ATTCTTAGCT CACTTGAAGT AGCAGGTGTC CAGCAAGTGA TCTGCGATCT CCGCTTTGAG
GATTTTGATT TTGAAATGAA GTGGGATTTA GTGGGCATTA GCGTCCAAAG TGGCATGGCT
CGCAAAGCCT TTGAGCTTGC TGATGCGCTT CGTGCGAAAG GAATAAAAGT AGCGCTTGGC
GGTGCCCATG TAACCCTTTT CCCTGAAAGC TGCCAACCTC ATGCGGATGT TTTGGTGCCC
GGTGAAGCCG ATGAAGTGTG GGAAGAGGCG TTGCGCGATC TTGTAGCGAA TAGGCTCCAA
CCACTTTACC GTGCCGAAAG CTTCCCAAAC CTTCAGCACG CCCGTCCCGT TAGCAAACAA
GCGTTACAAC CCGAACGCTA CTTTACCACC AATTTGATAC AAACAGGACG AGGCTGCCCA
TACAACTGCG ACTTTTGCAA TGTTCACGTC TTGAATGGGC ACACCTTGCG CCAACGCCGT
ATTACCGATG TAGTGCAAGA AGTTGCTCGC TTTCAACAAG ATGATCAGCG CATTTTCTTT
TTTGTTGACG ACTCCATCAA TGCTGATCCC GCTTACGCCT TAGAGCTTTT TCAATCCCTG
ACTCCCCTTA AAATCCGTTG GTTTGGGCAA GCCACTACCA CCTTAGGGCA GCAGCACGAA
CTCCTTAGCG CCTTTGCCGA CTCAGGCTGC CAAGCATTAT TGGTTGGCAT TGAAAGTATT
GAGAACGCCA GCCGCACAGC CCACGCTAAA CAGCAAAACC GTGCAAACGA GTTAGTGAGC
GCCATAACCA CCATTCGCCA AGCAGGCATT AGCCTTTACG GCAGCTTTAT TTATGGACTT
GATGGCGACA CCCTCGAAAC ACCCGCTGCA ATTTTAGATT TTGTAGCACA AACAAAACTT
GATGTACCCG GCATTAACAT TTTACGCCCA ACCCCAGGCA CCCGCGTTTT TGAACGCCTC
CGCAACGAAG GACGCTTACT GTTTGACCCA AATGATGTAA CAGCATACCG CTACTCTTTT
GGACAAGAAA TGCTCTATCG CCCAAAAAAC ATTCCACTTG ACGACTTTAT TGAAAGCTAT
AGCCAACTAA CACGCACTCT TTTTTCATGG CAAAACGCCG TTAAACGAGG ATTAAACGCC
CCACGAGCAA AAAGCGCCGT CCTGCTTTTT AACCTCTTCT ATAGCCACCT TTACACCCTC
TCGCGCAACG ACCTGCAAGC ACAAAAACTA TCGTAA
 
Protein sequence
MAEHVKKVAL VFLPSESGVD GARSLYANKA AQHPLKEWAN NAFRGIIKRS QFAIPPLSLM 
ILSSLEVAGV QQVICDLRFE DFDFEMKWDL VGISVQSGMA RKAFELADAL RAKGIKVALG
GAHVTLFPES CQPHADVLVP GEADEVWEEA LRDLVANRLQ PLYRAESFPN LQHARPVSKQ
ALQPERYFTT NLIQTGRGCP YNCDFCNVHV LNGHTLRQRR ITDVVQEVAR FQQDDQRIFF
FVDDSINADP AYALELFQSL TPLKIRWFGQ ATTTLGQQHE LLSAFADSGC QALLVGIESI
ENASRTAHAK QQNRANELVS AITTIRQAGI SLYGSFIYGL DGDTLETPAA ILDFVAQTKL
DVPGINILRP TPGTRVFERL RNEGRLLFDP NDVTAYRYSF GQEMLYRPKN IPLDDFIESY
SQLTRTLFSW QNAVKRGLNA PRAKSAVLLF NLFYSHLYTL SRNDLQAQKL S