Gene Cphamn1_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1789 
SymbolpurU 
ID6375476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1934979 
End bp1935908 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content48% 
IMG OID642684282 
Productformyltetrahydrofolate deformylase 
Protein accessionYP_001960188 
Protein GI189500718 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0788] Formyltetrahydrofolate hydrolase 
TIGRFAM ID[TIGR00655] formyltetrahydrofolate deformylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00390951 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTATCTG TAGCAAACCC GATCAGCAAG GCGATTTTCG ATGATAACAT TTTAACCCGA 
AGCGATGTGA CGCATACTTC TCAAAAAGCA GTTCTCCTGC TCTCCTGTCC GGACCGCATC
GGGCTCGTTT CACGGATCTC GAATTTTATC TTCGAACGAA GAGGAAATAT TCTCGATCTC
GATGAACATG TGGATATTGC ATCAGGCATG TTTTTTATCA GGGTGTCCTG GAGCAGGGAT
GATGTATCCA TAACGACGGC TGATCTTCAA GGTGCATTCA GTCCGCTCGC CCTGGAGCTG
GGGGCTGACT GGAAAATTTA TGTGATTCCT GAAAAACCGC GCGTGGCTGT GTTTGTCTCC
AGGTATGATC ACTGTCTGCA GGATCTGTTA TGGCGATACA AGACCGGGGA ATTTGCTATG
GAAATCCCCT TGATTATATC CAATCACCGG GATCTGGAGG ATCTTGCCGC ACAGTATTCC
ATCCCTTTTC ATGTGTTCCC GAAAACTCGT GAAAACAAGC TGGAGCAGGA AACGAAGGAA
CTTGAATTGC TCAAGGAAAA CCGTGTCGAC ACGATTGTTC TTGCCCGGTA TATGCAGGTT
CTTTCTCAAC GGTTTGTCGA TGCGTATCCT GACAGGATCA TCAACATCCA TCACTCGTTT
CTTCCTGCCT TTTCAGGCGG CAGTCCTTAT AAACAGGCCT TTGAAAGGGG GGTCAAAATA
ATCGGCGCTA CCAGTCACTA TGTGACCGGA GAACTCGATG AAGGTCCGAT AATCGAGCAG
GATATCATCA GAATCACGCA CAAGGACACT CTCGGCGATC TTATACGAAA AGGTCGGGAC
CTCGAGCGTC TGGTTCTTTC AAGGGCGATC AGTTCGCATG TAGACCACCG GGTTCTGGTA
AACGGCCGTA AAACCATTAT TTTTACCTGA
 
Protein sequence
MLSVANPISK AIFDDNILTR SDVTHTSQKA VLLLSCPDRI GLVSRISNFI FERRGNILDL 
DEHVDIASGM FFIRVSWSRD DVSITTADLQ GAFSPLALEL GADWKIYVIP EKPRVAVFVS
RYDHCLQDLL WRYKTGEFAM EIPLIISNHR DLEDLAAQYS IPFHVFPKTR ENKLEQETKE
LELLKENRVD TIVLARYMQV LSQRFVDAYP DRIINIHHSF LPAFSGGSPY KQAFERGVKI
IGATSHYVTG ELDEGPIIEQ DIIRITHKDT LGDLIRKGRD LERLVLSRAI SSHVDHRVLV
NGRKTIIFT