Gene Caul_2943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2943 
Symbol 
ID5900398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3191441 
End bp3192643 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content70% 
IMG OID641563440 
Productaminotransferase class I and II 
Protein accessionYP_001684568 
Protein GI167646905 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.064626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGC CCGATCCCTG CGCCGGTTCG GCCGAGCCGA CCTATCCCGC GTGGCTGCAC 
CGTCACCTGG ACGAGCAGGC GGCGATCATC CGCGCGCTGG ACCCCTTCCC GTTCGCCCAG
CCGGCCGACA CCACGGGCTT TCATTCGCTG CATCAGAACG ACTACCTGCG CCTGTCCAAT
CATCCGGAGG TCATCCGGGC ACGGACCGAG GCGGCGAGCC GGTCGCGGGT CGATTCGTTC
TCGTCGTTCG TGTTCGGCGG GGCGGCGGCG GAGCACGAAA CCTTCGCCGC CCTGCTGCGC
GAGGCCCTGC AGGCCTGCCA GGTCATCGCC ACGACGGCAG GCTGGACGGC CAATGTCGGG
CTGATCGAGG CGATCGCCGC GCCGGACGTG CCGATCTATG TCGACGCCGA GGCCCACGCC
TCCTTACTGG ACGGGGTGCG GCTGTCGCTG GGCCGGCGCC TGCTGTTCCG CCACAACGAT
CCCCAGCACC TGGAAGACCG CATCGCCATC CACGGTCCGG GCATCGTGAT CATCGACGCG
CTCTACAGCA CCGACGGCAC CTTGGCCGAC CTGCCGCGCT TCGTCGCCAT ATGCGAGCGC
CACGAATGCA CCCTGATCCT CGACGAGGCC CATTCGTTCG GCATGTTCGG CGAGGCCGGC
GGCGGGTTGG CGGTGGCCTG CGGCCTGGCC CATCGGGTCC ATTTCCGCAC CCTGAGCCTG
AGCAAGGCGC TGGGCGGCCA CGGCGGCGCC ATCGCCTGCG GCGCCCAGAT CGCCCCGGCC
CTGTGGAGCC GTCTGCGCCC GGTGATCTTC AGCTCGGCCA CCTCGTCCAT CCTGGCCGCC
GCCCACGCCA AGGCCCTGGA ACTGACCATG ACCGACCGTC GGCGAGCCGA ACATTGCCAG
GCCATGGCCA CCCTGCTGCG CGACCGGCTC AACGCCAGCG GCATCGACAC CCTGGGCAGC
GCCAGCCAGA TCATCTCGAT TAAGCTGCAG GGCGGCGACG CGGCCAAGCT CTACGGCGCC
CTGCGCGAGC GCGGGGTTCT GACCTCGGTG TTCATCTACC CAGCCGTGCA GATGGGGATC
AGCCTGGTGC GCCTTTCGGT GCACGCCGAG GTCACCGAGG CCGACGTCGA CTATGTCAGC
GCGACCATCG TCGAGAGCCT GGAGGCCCTG GGCCTGGGCA TGGACGGGAG GGTCGCGGCA
TGA
 
Protein sequence
MIAPDPCAGS AEPTYPAWLH RHLDEQAAII RALDPFPFAQ PADTTGFHSL HQNDYLRLSN 
HPEVIRARTE AASRSRVDSF SSFVFGGAAA EHETFAALLR EALQACQVIA TTAGWTANVG
LIEAIAAPDV PIYVDAEAHA SLLDGVRLSL GRRLLFRHND PQHLEDRIAI HGPGIVIIDA
LYSTDGTLAD LPRFVAICER HECTLILDEA HSFGMFGEAG GGLAVACGLA HRVHFRTLSL
SKALGGHGGA IACGAQIAPA LWSRLRPVIF SSATSSILAA AHAKALELTM TDRRRAEHCQ
AMATLLRDRL NASGIDTLGS ASQIISIKLQ GGDAAKLYGA LRERGVLTSV FIYPAVQMGI
SLVRLSVHAE VTEADVDYVS ATIVESLEAL GLGMDGRVAA