Gene Caul_4522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4522 
Symbol 
ID5901983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4893816 
End bp4895522 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content73% 
IMG OID641565041 
Productglycosyl transferase family protein 
Protein accessionYP_001686140 
Protein GI167648477 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.185613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.129654 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCG AATCTCGCCT GGATGACTGG AGCCGCGGGT GGCGCGGCCC GCTGTTCGCC 
GCGCTGGTGG CCCTGATCGC CGGCCTGCCC GGGCTGTTCG CCATGCCGCC GCTCGACCGC
GACGAGTCGC GCTTCGCCCA GGCCACCGCC CAGATGCTGG AAACCGACGA CATGGTCGTC
ATCCGCTTCC AGGATCAGCC GCGCTTCAAG AAGCCGGTCG GCATCCACTG GCTGCAGGCG
GCCAGCGTCA CCGCCTTCTC GGCCGCCGAG GATCGCGGCA TCTGGGCCTA TCGCATTCCC
TCCCTGCTGG GCGCGATGCT GGCGGCGGCG GCCTGCGCCT GGGGTGCGGC GGCGCTGCTG
GGACCGCGGA CGGGCCTGCT GGCCGGCGGC ATCCTGGGCG CGACCCTCCT GCTGTCGACC
GAGGCCTTCA TCGCCAAGAC CGACGCGGCC CTGTGCGGCT TCACCACCCT GGCCATGGCC
GCCCTGATGC GGATCTACGC CGCCCACCTG AACGGCGGAA CCATCACCCG CTGGACCAAG
CTGGCCTTCT GGGTCGGCCT GGCCATGGGC GTGCTGATCA AGGGTCCGGT CGGGCTGATG
GTCGTGGTGC TGAGCCTGCT GATGCTGGCG CTGTGGGACC GCAAGGCCCG CTGGCTGAAG
GACCTGGGCT GGAGCTGGGG CCTGATCCTG CTGGCGGCGA TCGTCCTGCC CTGGGCCACG
ATGATCACCG TGGCCACCGA CGGGGCCTTC TGGTCGACGG CGGTGGCCGG CGACCTGGCG
CCGAAGCTGG CTGGCGGCCA GGAAAGCCAC GGCGCGCCGT TCGGCAGCTA CGCCCTGGCG
GCCTTCCTGC TGGTGTTTCC CGCCACCCTG CTGTTGCCGG CCGGCCTGGC CCAGGGCTGG
ACCCAGCGCA AGGACGCCGG GATCCGCTTC GCCCTCTGCT GGCTGATCCC CACCTGGCTG
GTGTTCGAGA TCCTGCCGAC CAAGCTGGTC CACTACACCC TGCCCGCCGT GCCGGCCCTG
GCCATGCTGA TGGCCGCCGC CCTGCGCCGC CCCCTGGGCG GGATCTCGCG GGCGATCGGC
GCGGTGCTGT CGACCCTGGC CGGGGTGCTG CTGGCCGGTC TGGTCGGCTA TCTCTATTCG
GCGCATGGCG ATCCGAGCGA CCTGCCCGTG ACGATCCTGA CCGCCCTGCT GTTCCTGGCC
GCCGGCGTCG TCGGGACGAT CCTGATCCTG CGCAAGACCG CCGCCACGGC CCTGGTCGCG
GCCGGGGTCC TGGGTATCCT GGCCCATGGC GCCCTGGTGG GCCTGTTCGT GCCGCGCCTG
GAACCGCTGC TGCTGGCGCC GCGCCTGGAA AAGGCCCTCG AGCGGGCCGA CCTGGCGCCG
CGCGGCGGCG CGCCCGGTCC CGTGGCCGTC ACCGGCTATG CCGAGCCCAG CATGATCTTC
CTGCTGGGCA CCACCACCGA ACTGACCGAC CCGGCCGGCG CCGCCCAGGC CGTCGCCGAA
GGCCGGCCGG CCGTGGTCGA GGGACGCCAG GAGAAGGCCT TCCAGGCCGC CATGGCCGCC
CAGGGCCAGG CCGTTCGCCC CGCCGGCGTG GTCGAGGGCT TCGACTATTC CGATGGCGAC
AAGGAACGGC TGACGCTCTA TCGCGGCGCG CCGATCCGGC CCGATGTCGA AGACGACAGC
GCGGCGCAGC AGGAGACCCG CCCATGA
 
Protein sequence
MTLESRLDDW SRGWRGPLFA ALVALIAGLP GLFAMPPLDR DESRFAQATA QMLETDDMVV 
IRFQDQPRFK KPVGIHWLQA ASVTAFSAAE DRGIWAYRIP SLLGAMLAAA ACAWGAAALL
GPRTGLLAGG ILGATLLLST EAFIAKTDAA LCGFTTLAMA ALMRIYAAHL NGGTITRWTK
LAFWVGLAMG VLIKGPVGLM VVVLSLLMLA LWDRKARWLK DLGWSWGLIL LAAIVLPWAT
MITVATDGAF WSTAVAGDLA PKLAGGQESH GAPFGSYALA AFLLVFPATL LLPAGLAQGW
TQRKDAGIRF ALCWLIPTWL VFEILPTKLV HYTLPAVPAL AMLMAAALRR PLGGISRAIG
AVLSTLAGVL LAGLVGYLYS AHGDPSDLPV TILTALLFLA AGVVGTILIL RKTAATALVA
AGVLGILAHG ALVGLFVPRL EPLLLAPRLE KALERADLAP RGGAPGPVAV TGYAEPSMIF
LLGTTTELTD PAGAAQAVAE GRPAVVEGRQ EKAFQAAMAA QGQAVRPAGV VEGFDYSDGD
KERLTLYRGA PIRPDVEDDS AAQQETRP