Gene Haur_2871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2871 
Symbol 
ID5734742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3642329 
End bp3643798 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content52% 
IMG OID641280014 
Productglycine dehydrogenase subunit 2 
Protein accessionYP_001545637 
Protein GI159899390 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATACCT ACGAACCACC AATTTTTGAA TTGAGCAGCC CCGGTAAAAT TGGCGCAGCC 
CTGCCCGACG CTGGTGTGCC CGAAACCGAG TTGCCAGCCC ATTTGCTGCG CGACGACGAT
TTAGCAGGTT TGCCCGAAAT TAGCGAAACC GAGGTGGTGC GCCACTTCAC CCGCATCTCA
CAGCGCAACT TCTGTATCGA CACGGGGATG TATCCACTTG GCTCGTGCAC GATGAAATAC
AATCCTAAAA TTCACGAAGA TGCTGCACGG TTGCCTGGCT TTGCCTTTAT TCACCCCTTG
CAAGCCGAAG AAACCGTCCA AGGTGCTTTG CAATTGACCT ATGAATTGCA AAATATCTTG
GCCGAAATTT CGGGCTTTGA TCAAGTTTGT TTGCAGCCAG CCGCTGGCGC TCAAGGCGAA
TTCGCAGGCA TTTTGGTGTT CCGCGCCTAT CACCTTGATC GTGGCGATGA TCAGCGTGAC
GAAGTGTTGT GCCCCAACTC TGCTCACGGC ACGAATCCCG CGACCGCCGC GATGGTCGGT
TTCAAAGTCG TCGAAATTGC TACCGATAGT CGGGGCAATG TTGATTTGGC TGATCTGCGG
GCCAAAGTTG GCCCACGTAC CGCTGGCTTG ATGTTGACCA ACCCCAATAC CTTGGGCTTG
TTTGATGAAA ACGTGCATGA AGTTGCCAAA ATTGTGCATG AAGCTGGTGG CTTGATGTAT
GGCGACGGCG CAAATTTCAA CGCGATTTTG GGGATTGTCA AGCCAGGCGA TGTAGGCTTT
GACTTTATGC ACTACAACTT GCACAAAACC TTTACCACGC CTCACGGTGG TGGTGGCCCT
GGTTGTGGTG CGGTTGGCTG TAAAGAATTT TTGGCCGATT ATCTGCCAGG CCCAATTGTG
GCACTCAAAG AAGGCCAATA TACCCGCCAC ACGCCAGTCA AGAGCATTGG CCGCTTGAAA
GCCTTCAAGG GCAATTTTGG CATGTTTGTG CGGGCCTACA CCTACATTCG CATGCTCGGC
GCGGCTGGCT TGCGCTCGGT CAGCGAACAC GCGGTACTCA ACGCCAACTA TCTACGGGTC
AATTTGGATA AAATTTATCC TGTGGCCTAC GACCGCACCT GTATGCACGA AGTGGTGTTG
CAAGGCAAAA TCAAGGGTGC GCCAAGTGTC CATACCTTGG ATATTGCCAA GCGCTTGATT
GACTTTGGTT TTCACCCACC AACAGTCTAT TTCCCGATCA GCGTGGCCGA ATCGATTATG
ATCGAGCCAA CTGAAACCGA ATCGAAGCGC AACATGGATG CCTTTATTGA TGCCATGAAG
CAAATTGCCC ATGAAGCGGT TGAAAACCCA GAGCTGTTGC ATGCTGCGCC AACCACCGCG
CCAGTACGCC GCCTCGATGA AGCCACCGCT GCCCGCCGCC CAATTCTGAA ATACGATCAA
GCAGCAATCG CGGCTTTGTT AAGCAAATAA
 
Protein sequence
MHTYEPPIFE LSSPGKIGAA LPDAGVPETE LPAHLLRDDD LAGLPEISET EVVRHFTRIS 
QRNFCIDTGM YPLGSCTMKY NPKIHEDAAR LPGFAFIHPL QAEETVQGAL QLTYELQNIL
AEISGFDQVC LQPAAGAQGE FAGILVFRAY HLDRGDDQRD EVLCPNSAHG TNPATAAMVG
FKVVEIATDS RGNVDLADLR AKVGPRTAGL MLTNPNTLGL FDENVHEVAK IVHEAGGLMY
GDGANFNAIL GIVKPGDVGF DFMHYNLHKT FTTPHGGGGP GCGAVGCKEF LADYLPGPIV
ALKEGQYTRH TPVKSIGRLK AFKGNFGMFV RAYTYIRMLG AAGLRSVSEH AVLNANYLRV
NLDKIYPVAY DRTCMHEVVL QGKIKGAPSV HTLDIAKRLI DFGFHPPTVY FPISVAESIM
IEPTETESKR NMDAFIDAMK QIAHEAVENP ELLHAAPTTA PVRRLDEATA ARRPILKYDQ
AAIAALLSK