Gene Cphamn1_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0937 
Symbol 
ID6374604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1014004 
End bp1015122 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content50% 
IMG OID642683439 
Productchlorophyllide reductase iron protein subunit X 
Protein accessionYP_001959363 
Protein GI189499893 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1348] Nitrogenase subunit NifH (ATPase) 
TIGRFAM ID[TIGR02016] chlorophyllide reductase iron protein subunit X 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAC CTCGCACAAT TGCCATTTAT GGAAAAGGCG GTATAGGAAA AAGCTTCACG 
ACAACCAACC TCAGCGCTAC ATTCGCGCTT ATGGGAAAAA GAGTGCTACA GCTTGGCTGT
GACCCTAAAC ACGATTCAAC AACGTCGCTT TTTGGCGGCG TCTCGCTGCC GACCGTCACA
GAAGTGTTCG CAGAAAAAAA TGCCCGCAAC GAAGAGCTGC AAATCAGCGA CATCGTCTTC
AGGAGAGATA TACCGGATTT TCCGCAACCG ATCTATGGCG TTGAACTCGG CGGGCCGCAG
GTTGGAAGAG GGTGCGGAGG CAGAGGCATT ATATCCGGTT TTGACGTACT GGAAAAATTA
GGCCTGTTTT CCTGGGACCT GGATGTCATT CTGATGGATT TCCTTGGAGA CGTCGTCTGC
GGAGGTTTCG CAACGCCTCT TGCCCGCTCC CTGAGTGAGG AAGTGATACT GCTTACAAAC
AACGACCGTC AGTCGATTTT CACCGCAAAC AATATCTGCC AGGCAAACAA CTACTTCAAG
ACCGTCGGTG GAGAGTCAAA ACTGCTCGGG CTCATTATCA ACCGGGACGA CGGTAGCGGC
ATTGCAGAAA AGTACGCTGC TGAGGCAGGA ATCACTATCC TCATGAAACT GCCGCATAAC
ACCGCGGCGA GAGACAAGGA CGACAGTTTC GATTTTGCCG TCCGTCTTCC TGAAATCGGA
GAACCGTTCC GCAAACTCGC CACTGATATT CTTGAAAGAA AAATAACACC CTGTGAAGCT
GCAGGACTTG ATTTTCAGAC ATTTATCCGC CTTTTCGGAG AGGTAAACGA AGCTCACCCG
ACCCCGGCGT CCCAAGATGA ATTAACCGGT CAAAAACAAC AGATCAACGG CGAGAGGCCC
GAAGCGGCAC AAAACGATTC AGTTTCACCT GAAAGCGAAA AACTGTTTGC CTGCATTGAA
AAACTCCCTG ATTCCGAAAA GGAAATCTAC CGCTTGATCG AGGTGGAGAA AAAAAGCGCT
GCGGAAGCAG CCGGAATAAA GGGGATCAGC GAAGCAGAGG CACAGGAAAT TTTTTCTTCA
GCCAGAACCC ACCTCAGAAA ACTGTTCTTC TCCGTTTGA
 
Protein sequence
MSTPRTIAIY GKGGIGKSFT TTNLSATFAL MGKRVLQLGC DPKHDSTTSL FGGVSLPTVT 
EVFAEKNARN EELQISDIVF RRDIPDFPQP IYGVELGGPQ VGRGCGGRGI ISGFDVLEKL
GLFSWDLDVI LMDFLGDVVC GGFATPLARS LSEEVILLTN NDRQSIFTAN NICQANNYFK
TVGGESKLLG LIINRDDGSG IAEKYAAEAG ITILMKLPHN TAARDKDDSF DFAVRLPEIG
EPFRKLATDI LERKITPCEA AGLDFQTFIR LFGEVNEAHP TPASQDELTG QKQQINGERP
EAAQNDSVSP ESEKLFACIE KLPDSEKEIY RLIEVEKKSA AEAAGIKGIS EAEAQEIFSS
ARTHLRKLFF SV