Gene Caul_4383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4383 
Symbol 
ID5901844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4757265 
End bp4758848 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content70% 
IMG OID641564901 
Productglycine dehydrogenase subunit 2 
Protein accessionYP_001686001 
Protein GI167648338 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.798393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGA ATTCCGTCGG ACGGCCGACC CGTCCCGAAG CCCCCTCCAA GACGGGCGAC 
GTCAACGACT ATGTCCACCC GACCCTGACG GGGGCTCGCG GCCTGTTGCA GGACGAGGCG
CTGATCTTCG AATTGAACGC CTGGGACAAG ACCGGCGTCG ACCTGCCGGA CGCGCCGCCG
GCCGACGACC TGGGCGACCT GGTCCGTAGC GCCCCAATCG GCCTGCCCGG CCTTTCGGAG
CCCGAGGCCG TGCGCCACTA CGTGCGCCTG AGCCAGAAGA ACCACGCCAT CGACCTGGCG
CTGTATCCGC TGGGCTCGTG CACGATGAAG CACAACCCGC GCCTGAACGA GAAGATGGCG
CGCCTGCCGG GCTTCGCCGA CATCCACCCG CTACAGCCGC AGTCGACGGT GCAGGGCGCG
CTCGAGCTGA TGGACCGCCT GGCCCACTGG CTGAAGACCC TGACGGGCAT GCCGGCCGTG
GCGCTCAGCC CCAAGGCGGG CGCCCATGGC GAGATGTGCG GCATGCTGGC CATCCGCGCG
GCCCATGAGG CCAACGGCCA GGGCCACCGC AAGACGGTGC TGGCCCCGAC CAGCGCCCAC
GGCACCAACC CGGCCACGGC GGCCTTCGTC GGCTACACGG TCGTCGAGAT CGCCCAGACC
GACGACGGCC GGGTCGATTT GGCCGACCTG GAGTCCAAGC TCGGCGACCA CGTGGCCGCC
ATCATGGTCA CCAACCCCAA CACCTGCGGC CTGTTCGAAC GCGACGTGGT CGAGATCGCT
CGCCTGACCC ACGCGGCGGG CGCGTACTTC TACTGCGACG GCGCCAACTT CAACGCCATC
GTCGGCCGGG TGCGCCCCGG CGACCTGGGC GTCGACGCCA TGCACATCAA CCTGCACAAG
ACCTTCTCGA CGCCACACGG CGGCGGCGGT CCGGGCGCGG GTCCGGTGGT GCTGAGCGCC
GCCCTGGCCC CGTTCGCGCC CACGCCCTGG GTCGTCAGCC ACGACTACGG CTTCAAGCTG
GTGGAGGAGG CTGCCGGCCC CGCCGCCCAG GCCTTTGGCC GGATGAGCGC CTTCCACGGC
CAGATGGGCA TGTTCGTGCG GGCCTATGCC TACATGCTCA GCCATGGGGC CGACGGCCTG
CGCCAGGTGG CCGAGGACGC CGTTCTTAAC GCCAACTACA TCAAGGCCCG CCTGAGCGAC
CTGATGAGCC CGGCCTTCCC GGACGGCCCC TGCATGCACG AGGCCCTGTT CGACGACGCC
TGGCTGGACG GCACTGGGGT GACCACGCTC GACTTCGCCA AGGCGATGAT CGACGAGGGC
TTCCACCCGA TGACCATGTA CTTCCCGCTG GTGGTGCATG GCGCGATGCT GATCGAGCCG
ACCGAGACCG AGAGCAAGCA AGAGCTGGAC CGCTTCGTCA CCGCGCTGCG CGCCTTGGCC
GGCGCGGCCA GGAGGGGCGA GGTCGAGCGC TTCAAGGGCG CGCCCTTCCA TGCGCCGCTC
AAGCGCCTGG ACGAGACCCT GGCGGCGCGG AAGCCGCGTC TGAGGTGGAA ACCTGTGGCC
GCCGCGCCAC TGGCTGCGGA ATAG
 
Protein sequence
MSMNSVGRPT RPEAPSKTGD VNDYVHPTLT GARGLLQDEA LIFELNAWDK TGVDLPDAPP 
ADDLGDLVRS APIGLPGLSE PEAVRHYVRL SQKNHAIDLA LYPLGSCTMK HNPRLNEKMA
RLPGFADIHP LQPQSTVQGA LELMDRLAHW LKTLTGMPAV ALSPKAGAHG EMCGMLAIRA
AHEANGQGHR KTVLAPTSAH GTNPATAAFV GYTVVEIAQT DDGRVDLADL ESKLGDHVAA
IMVTNPNTCG LFERDVVEIA RLTHAAGAYF YCDGANFNAI VGRVRPGDLG VDAMHINLHK
TFSTPHGGGG PGAGPVVLSA ALAPFAPTPW VVSHDYGFKL VEEAAGPAAQ AFGRMSAFHG
QMGMFVRAYA YMLSHGADGL RQVAEDAVLN ANYIKARLSD LMSPAFPDGP CMHEALFDDA
WLDGTGVTTL DFAKAMIDEG FHPMTMYFPL VVHGAMLIEP TETESKQELD RFVTALRALA
GAARRGEVER FKGAPFHAPL KRLDETLAAR KPRLRWKPVA AAPLAAE