Gene Caul_2178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2178 
Symbol 
ID5899633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2365570 
End bp2366760 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content70% 
IMG OID641562669 
Productcarbohydrate-selective porin OprB 
Protein accessionYP_001683804 
Protein GI167646141 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3659] Carbohydrate-selective porin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.278032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.401422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCAT CTAGACCAAA CTGCCTTATC GCCATGGTCG CGGCCCTGGC CGTCTCCGCC 
CTGAGCTCCG CCGCCCTGGC GCAGACGGCG GGCGAGGGCG CGTGGTCGCA CGAGGTCGTC
TACACCGCGG ACGTCGCCGG CCCGGTGCGC GGCGGGGCGG CCCATGCCGG TCGGGCCTTG
GACAACCTGG ACGTCATCAT CGACGGCGAC CTGGACAAGG CCTTCGGCTG GCGCGGCCTG
GCCGTGCACG GCTACCTGCT CAACAACAGT GGCGGCGCCC CCAACGATAT CGCCGGCACT
CTGCAGGGCG TCGACAACAT CGAGGTCGGG CGTCCCCGGG CGCGGCTCTA CGAGTTGTGG
CTCAAGGCCA GTTTCGCCGG CGACAAGGGC TCGGTGCTGG CCGGGCTCTA TGACCTCAAC
AGCGAATTCT ACTCGACCCA GGCCTCGGGC CTGCTGCTGG CCCCGCCGTT CGGCATTGGC
TCGGAGCTCG CCTCGACCGG CCCCAACGGT CCGTCGATCT TCCCGTCCAC CGCCCTGGCG
GTGCGGATGC GGGTCGAGGG CAAGCAGGGA CGCTACGTCC AGGCCGCCGT GCTCAACGCC
AAGGCCGGGA CGGTGGGCGA TCCGGATGGG CCGGCGACGG AGTTCGATCA CGGCGCGCTG
ATAGTCGCCG AGGCCGGGAT CGGCGCGACA TGGCGGCTGG CGGCCGGGGG CTGGTTCTAC
ACCCAGCGCC AGACGGACCT GCGCGACCTC GACGCCAAGG GCGACCCGGC CCGGAGCCAC
GCGCGCGGCG CCTACCTTCT GGCGGAGTAT CCCTTCGTCG ATGGCGGGGT GAGCGGACGC
TCGGTGCGGG GCTTCGCTCG CCTGGGCCTT TCGGACGGCG ACACCACGGC GTTCCGTTCG
GGCTGGCAGG CCGGCGTGCT GGTGGAGAAG GTTTTCGCCT CGCGCCCCGA CAGCGCCTTC
TCGGTCGGGG TGGAGCAGGG GATGCTATCG TCCAAGCAGC GCGACAACAC CCGCGACGCC
GGTGTCTCCC CGGCCCACGC CGAGTCCAGC ATCGAGATCA CCTATTCAGA CAAGGTCCTG
CCGCGACTCA CCCTGCAGCC GGACGTCCAG TTGATCCGCC GGGCCGGCGG TGATCGCGAC
GCCCGTGACG TGGTGGTCGT GGCCTTGCGG ATGACGATCA GCCTGTTCTA G
 
Protein sequence
MTSSRPNCLI AMVAALAVSA LSSAALAQTA GEGAWSHEVV YTADVAGPVR GGAAHAGRAL 
DNLDVIIDGD LDKAFGWRGL AVHGYLLNNS GGAPNDIAGT LQGVDNIEVG RPRARLYELW
LKASFAGDKG SVLAGLYDLN SEFYSTQASG LLLAPPFGIG SELASTGPNG PSIFPSTALA
VRMRVEGKQG RYVQAAVLNA KAGTVGDPDG PATEFDHGAL IVAEAGIGAT WRLAAGGWFY
TQRQTDLRDL DAKGDPARSH ARGAYLLAEY PFVDGGVSGR SVRGFARLGL SDGDTTAFRS
GWQAGVLVEK VFASRPDSAF SVGVEQGMLS SKQRDNTRDA GVSPAHAESS IEITYSDKVL
PRLTLQPDVQ LIRRAGGDRD ARDVVVVALR MTISLF