Gene Caul_4387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4387 
Symbol 
ID5901848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4761110 
End bp4762222 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content69% 
IMG OID641564905 
Productglycine cleavage system T protein 
Protein accessionYP_001686005 
Protein GI167648342 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR00528] glycine cleavage system T protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.874222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.817765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGACC AAGATCTCAA GAAGACTCCG CTGTACGACG CGCACGTCGC GGCTGGCGCC 
CGCATGGTGC CGTTCGCCGG CTATTCCATG CCGGTGCAGT ACAAGGACGG GGTGCTGAAG
GAGCACCTGT GGACCCGCGA GCACGCGGGC CTGTTCGACG TCTCGCACAT GGGTCAGGCC
CGGCTGCGTG GCGCCAATCC CGCCAAGAGC TTCGAGAAGC TGGTCTCGGC CGACTACCAG
GGCCTCAAGC CGGGCAAGCA GCGCTATGCG GTGCTGCTGA ACGATCAGGG CGGGGTGATC
GACGACCTGA TGACGGCGCG TCCCGACGAC GACGGCCTGT TCATCGTCGT CAACGGCGCC
TGCAAGGACA ACGACTACGC CATCATCGCC AAGGCCCTCG AGGGTGAGGC GACCGTGGAA
CGGCTGGAGG ACCGCGCCCT GCTGGCCCTG CAGGGCCCCG AGGCCGCCGC CGTGCTGGCC
GCCCATGTGC CGGAGGCCGC AGGCATGGTG TTCATGGACA CCGCCGCCCT GACCGCCTTC
GGGACCGACG CCATCATCTC GCGCTCGGGC TATACCGGCG AGGACGGTTA CGAGATCTCG
GTGCCGGCCA GCGAGGCCGC GCGCATCTGG AACACCCTGC TGCAGGACGA GCGGGTCAAG
GCGATCGGCC TGGGCGCCCG CGATTCCTTA CGCCTAGAGG CCGGGCTGCC GCTCTACGGC
CACGACATGG ACGAGACGGT TTCGCCGATC GAGGCCGGCA TGCCGTTCGC CGTCGGCAAG
AGCCGCCGCG AGGCCGGCGA TTTCCCTGGC GCGGCGCGGA TCCTCAAGGA ACTGGCCGGC
GACCTCAAGC GCGTCCGCGT CAATCTGAAG GTGCTGGAAG GCGCTCCGGC CCGTGAAGGC
GCGGAAATCG CCGACGAGAC CGGCGCCGTG GTCGGCGTGG TCACCAGCGG CGGCTTCGGC
CCCAGCTATG GCGGCGCCAT CGCCATCGGC TTCGTGCCTC CCGCCCTGGC GGTGGTCGGC
GGGACGCTGA AAGTCATCGT TCGCGGCAAG CCGCAGGCGG CGGAGGTCGT GACCTCGCCG
TTCGTTCCCA CTCGCTACGT GCGCAAAATC TAA
 
Protein sequence
MSDQDLKKTP LYDAHVAAGA RMVPFAGYSM PVQYKDGVLK EHLWTREHAG LFDVSHMGQA 
RLRGANPAKS FEKLVSADYQ GLKPGKQRYA VLLNDQGGVI DDLMTARPDD DGLFIVVNGA
CKDNDYAIIA KALEGEATVE RLEDRALLAL QGPEAAAVLA AHVPEAAGMV FMDTAALTAF
GTDAIISRSG YTGEDGYEIS VPASEAARIW NTLLQDERVK AIGLGARDSL RLEAGLPLYG
HDMDETVSPI EAGMPFAVGK SRREAGDFPG AARILKELAG DLKRVRVNLK VLEGAPAREG
AEIADETGAV VGVVTSGGFG PSYGGAIAIG FVPPALAVVG GTLKVIVRGK PQAAEVVTSP
FVPTRYVRKI