Gene CHU_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1021 
SymbolgcvP 
ID4183758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp1169896 
End bp1172796 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content44% 
IMG OID638071019 
Productglycine dehydrogenase 
Protein accessionYP_677638 
Protein GI110637431 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAG ATTTAACCCA AAGAGGTTCC TTCCAGGAAA GACACATCGG TACGTCACTT 
TCAGAAAAAG AAACGATGCT GAAAGCTATC GGTGTCGGTT CGCTTGATCA GTTGATTGAT
GAAACTGTTC CGGCAAACAT TCGCCTGGCA AACCCTCTGC AATTGCCGAA GGCATTGAGT
GAAGAAGAAT TTCTGGTTGA ATTCAAAAAA GTTGTTTCTC AAAATGAAAT TTTTAAAACA
TATATCGGTT TGGGGTATTA CAATACCCTT ACGCCAACGG TTATTCTTCG TAATATATTA
GAAAATCCGG GTTGGTATAC CGCTTATACA CCGTATCAGG CAGAGATTGC TCAAGGCCGT
CTGGAAGCCC TGATCAATTA CCAGACAATG GTTTGTGAAT TAACAGGTAT GGAGATTGCA
AATGCTTCGT TGCTGGATGA AGCTACTGCA GCGGCTGAAG CAATGACGCT TATCTACAGT
TTGCGTAAAG GAACGAAAAA GAATGCAACG GTTTTTGTTG TTTCAAAATA TACCTTTCCT
CAAACCATAG ACGTATTGTT AACGCGTTCA GAACCGCTGG GCATTAAATT AGAGATGGTA
GATCCTGCAA CCGCAGACTT TACACGTACA GACATATTTG GCGCATTGGT ACAGTCTCCG
GATTGTAACG GAAATGTTGC GGATTATACA GCACTTATTT CAGCCGCTCA TGCGAACGAT
ACCTTAGTTG CATTCGGTTC GGATCTGTTG AGCCTGACAC TTATTAAATC GCCGGGTGAA
ATGGGCGCAG ATGTTGTGTT CGGTTCTTCA CAGCGTTTTG GCGTGCCAAT GGGTTTTGGT
GGTCCGCATG CAGGTTTTTT TGCTACCAAG GAAGAATACA AACGCCAGCT GCCGGGCCGT
ATCATCGGTG TGTCCATTGA TGCAGAAGGC AAGAAAGCAT ATCGTATGTC TCTACAGACC
CGCGAGCAGC ATATCCGTAG AGAAAAAGCA ACCTCAAACA TTTGTACTGC CCAGGTATTA
TTATCTGTAA TTGCAGCTTC GTATGCTGTA TACCACGGAC CGGAAGGCCT GAAGCAGATC
GCTAACCGTG TACATGGCTT TGCTGCAGCG TTAAGCGCAA TTGTTACAGA ATTTGGTTTT
GAACAAACCA ATGCAGCATT TTTTGATACG ATTTCAATAA AAGCTCCGGG CAAAGGATTA
GCTATAAAAT CTGCGGCAGA GAAACGTAAG ATCAACCTGC GTTATGAAGG TGATTCAGTT
TTTGTAGCGC TGGATGAGTC GGTTACTGTA GCAGATCTGG ATCAGCTGAT TGAAGTGTTT
GCAGAAGCTG CAGATAAAAA AGTAGTAGCT ATTGATATTT CAAATAAGTA CCTGCGTGTT
GAAGGTGCAC TTTCAGGCGC GTTGCTGCGT ACGTCTGCGT ACATGACACA CCCGGTATTC
AATACGTACC ATACAGAGCA TGAAATGCTT CGTTATCTGA AGCATTTAGA AAACAAAGAT
CTTTCGTTAA CGCATTCCAT GATTTCATTG GGTTCTTGTA CCATGAAGCT GAATGCAACA
ACTGAAATGA TTCCGGTAAC CTGGCCGGAA GTAGGGCAGA TCCATCCGTT TGTACCCGCT
TCTCAGGTAA AAGGGTATCA TCAGTTATTC AACAATATTG AATTGTGGTT GTCTGAGATC
ACAGGTTTTG CAGGCGTTTC TGTACAGCCC AACTCAGGTG CTCAGGGTGA ATATGCCGGT
TTGTTAGTGA TCCGCGCATA CCATGAAAGC AGAGGCGATA CACACCGTAA TATTGCATTG
ATACCGACAT CGGCGCACGG TACCAATCCC GCAAGTGCTG TAATGGCAGC TATGCAGGTA
GTGCTTGTTA AATGCGATGA CGCAGGCAAT ATTGATGTTG CCGATCTGAA ACTAAAAGCT
GAACAGCACA GCAAAAACCT ATCGTGTCTG ATGGTTACGT ATCCTTCTAC ACACGGTGTA
TTCGAAGAAA GTATCATAGA AATCTGTGAA GTAATTCACC GTCACGGCGG ACAGGTTTAT
ATGGATGGTG CAAACATGAA TGCACAGGTT GGGTTAACCT CTCCGGGCAG CATCGGAGCG
GACGTTTGTC ACCTGAACTT ACATAAAACC TTCTGCATCC CGCACGGCGG AGGTGGTCCG
GGTGTTGGTC CGATTGGTGT GGCAGCGCAT TTGACACCGT TCTTACCGGG CCATGCTGTT
GTAAAAGCAG GTGGTGAAAA AGCGATCTCG GCTGTTTCTG CAGCGCCATG GGGCAGTGCA
AGCATCCTTC CGATCTCCTA TGCATATATT GCTATGATGG GTAGTGAAGG ACTAACGAAT
GCAACTAAAA ATGCGATCTT AAACGCAAAC TATATTAAAT CCCGTTTAGA GAAATTCTAT
CCGATTTTAT ATACCGGTAC ACACGGACAT TGCGCACATG AATTTATTCT GGATTGCAGA
GCCTTTAAAA CAACTATCGG TATTGAAGTT GAAGATATTG CAAAACGTTT GATGGATTAT
GGTTTCCACG CGCCGACCGT TTCATTCCCG GTAGCCGGAA CATTAATGAT TGAACCAACG
GAAAGCGAAT CTAAACATGA ACTGGATCGT TTCTGTGATG CGATGATTGC CATCCGTGCA
GAAATTGCAG AAATTGAAAA TGGTACAGCG GATAAACTGG ATAATGTATT GAAACATGCA
CCGCATACAG CAAGTGTTAT TACTGCTGAT GCCTGGACAC GTTCATATTC TAGACAGAAA
GCTGCGTATC CGTTGCCTTA TTTAAAAACA ACGAAGTTCT GGCCTTCTGT AAGCCGTATA
GACAGTGCAT ACGGTGACCG TAACCTGTTT TGCAGCTGTA TACCTACAGA GGAATTTGCA
GAAGCGGAGC TGGCAAATTA A
 
Protein sequence
MKIDLTQRGS FQERHIGTSL SEKETMLKAI GVGSLDQLID ETVPANIRLA NPLQLPKALS 
EEEFLVEFKK VVSQNEIFKT YIGLGYYNTL TPTVILRNIL ENPGWYTAYT PYQAEIAQGR
LEALINYQTM VCELTGMEIA NASLLDEATA AAEAMTLIYS LRKGTKKNAT VFVVSKYTFP
QTIDVLLTRS EPLGIKLEMV DPATADFTRT DIFGALVQSP DCNGNVADYT ALISAAHAND
TLVAFGSDLL SLTLIKSPGE MGADVVFGSS QRFGVPMGFG GPHAGFFATK EEYKRQLPGR
IIGVSIDAEG KKAYRMSLQT REQHIRREKA TSNICTAQVL LSVIAASYAV YHGPEGLKQI
ANRVHGFAAA LSAIVTEFGF EQTNAAFFDT ISIKAPGKGL AIKSAAEKRK INLRYEGDSV
FVALDESVTV ADLDQLIEVF AEAADKKVVA IDISNKYLRV EGALSGALLR TSAYMTHPVF
NTYHTEHEML RYLKHLENKD LSLTHSMISL GSCTMKLNAT TEMIPVTWPE VGQIHPFVPA
SQVKGYHQLF NNIELWLSEI TGFAGVSVQP NSGAQGEYAG LLVIRAYHES RGDTHRNIAL
IPTSAHGTNP ASAVMAAMQV VLVKCDDAGN IDVADLKLKA EQHSKNLSCL MVTYPSTHGV
FEESIIEICE VIHRHGGQVY MDGANMNAQV GLTSPGSIGA DVCHLNLHKT FCIPHGGGGP
GVGPIGVAAH LTPFLPGHAV VKAGGEKAIS AVSAAPWGSA SILPISYAYI AMMGSEGLTN
ATKNAILNAN YIKSRLEKFY PILYTGTHGH CAHEFILDCR AFKTTIGIEV EDIAKRLMDY
GFHAPTVSFP VAGTLMIEPT ESESKHELDR FCDAMIAIRA EIAEIENGTA DKLDNVLKHA
PHTASVITAD AWTRSYSRQK AAYPLPYLKT TKFWPSVSRI DSAYGDRNLF CSCIPTEEFA
EAELAN