Gene Caul_1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1065 
Symbol 
ID5898520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1125203 
End bp1126675 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content71% 
IMG OID641561547 
Producthypothetical protein 
Protein accessionYP_001682693 
Protein GI167645030 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.961267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTGGA AGAGCAAGAC GGCGCTGACC GCCGTGGGCC TGGTGCTGGC GGTCGGGGCC 
GTGGTGGCGG TGCGCACGGC CACCTTCAAG GCGCCGGCGG CGGTCGACAT CTCCAGCGTC
CACCTGGCCT CGGCCCGGCC GTTCGACGTC AACCTGGCCG CCCGTCACCT GGGCGAGGCC
GTGCGCTTCC AGACCGTCAG TCACCAGGAT CCCGCCGAGG ATCAGCCGGC CGAGTGGGAC
AAGCTGCACG CCTGGCTGCA GGCGACCTAT CCCGACGCCC ACCGGGTGAT GACCCGCGAG
GTCGTGGCCG GCCACGCCCT GGTCTACACC TGGAAGGGCT CCGACCCGTC CCTGGCCCCC
ATCGTGCTGA TGGCCCACCA GGACGTGGTG CCGGTGACGC CCGGCAGCGA GGGCGGCTGG
ACCCATCCGC CGTTCGACGG CGTGGTCGCC GACGGCGCGG TTTGGGGGCG CGGCTCGATC
GACGACAAGG GCAGCCTGGT CACCCTGTTC GAGGCGCTGG ACGGCTTGGC CAAGGCCGGC
TTCACGCCCC GGCGCACCGT GATCCTGGTC AGCGGCCATG ACGAGGAGGT GCGCGGCGGC
GGGGCCAAGG CGGCGGCGGC CCTGCTGAAG GCGCGCGGGG TCAAGGCCCA GTTCGTGCTC
GACGAGGGCA TGGTGGTGGT CGAGGACCAT CCGGTGACCA AGGGCAAGGT CGCCCTGATC
GCCACCGCCG AGAAGGGCTA CGCCACCCTC ACCGTCATCG CCCCGGCCGT GGGCGGCCAT
TCCTCGGCCC CGCCGCCCCA GACCGGGGTC GCCACCCTGG CCAAGGCGGT GCTGGCCATC
GCCGACAACC CGTTCCCGAT GAAGTTCAGC GGCCCCGGCG CCGACATGCT GAAAAGCCTG
GCTCCGCACA GCGGGACGGC CATCAAGATG GCGGCGGCCA ATACCTGGCT GTTCTCGCCG
CTCCTGGTGA AGGAGACGTC CAAGACGCCG GCCGGGGCGG CCATGCTGCA CACCACCATC
GCCCCGACCA TGCTGAAGGG CTCGCCCAAG GAGAACGTCC TGCCGCAGGA CGCCGAAGCG
TGGATCAACT ATCGCATCGC GCCCGGCGAC AGCTCGGCCG ACGTGATGGC CAAGGCCAAG
GCGGCGGTCG GCGACCTGCC GGTCAAGCTG GCCTGGGTAA AGGCCCCGGA CGAACCCAGC
AAGGTCTCGT CGACCACCTC GGACGGCTGG AAGACCCTGG CCGCCCTGGC CGGCGACGAA
AGCAAGGCGC CCGTCGCCCC GGCCCTGATG ACCGCCGCCA GCGACAGCCG CTACATGGCC
CCGGTCGCCG ACGACATCTA CAAGTTCCAG CCGCTGCAGC TGTCGGTGAA GGACACCGAG
ATGATCCACG GCACCAACGA GCACATGACG ATCGCCAATG TCGAACGCAT GGTGCGGTTC
TACCAGCGGC TGGTCGAGAC GGCCGCGAAG TAA
 
Protein sequence
MGWKSKTALT AVGLVLAVGA VVAVRTATFK APAAVDISSV HLASARPFDV NLAARHLGEA 
VRFQTVSHQD PAEDQPAEWD KLHAWLQATY PDAHRVMTRE VVAGHALVYT WKGSDPSLAP
IVLMAHQDVV PVTPGSEGGW THPPFDGVVA DGAVWGRGSI DDKGSLVTLF EALDGLAKAG
FTPRRTVILV SGHDEEVRGG GAKAAAALLK ARGVKAQFVL DEGMVVVEDH PVTKGKVALI
ATAEKGYATL TVIAPAVGGH SSAPPPQTGV ATLAKAVLAI ADNPFPMKFS GPGADMLKSL
APHSGTAIKM AAANTWLFSP LLVKETSKTP AGAAMLHTTI APTMLKGSPK ENVLPQDAEA
WINYRIAPGD SSADVMAKAK AAVGDLPVKL AWVKAPDEPS KVSSTTSDGW KTLAALAGDE
SKAPVAPALM TAASDSRYMA PVADDIYKFQ PLQLSVKDTE MIHGTNEHMT IANVERMVRF
YQRLVETAAK