Gene Caul_4788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4788 
Symbol 
ID5902250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5174329 
End bp5175450 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content65% 
IMG OID641565308 
Producthypothetical protein 
Protein accessionYP_001686406 
Protein GI167648743 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATTC CCGCATCGAT CACCGGTCGC AACGCCCTGG TGTTCCTGGC GGTCGTCGCC 
GGGGGCGCGG CCCTCTACTG GATGCGGGGC ATCCTCACGC CCCTGGCCAT GGCGGTGTTC
CTGGCGGTGA TGATCGACAG CTTCGCCCGC GTGTTGGTGC TGCGCGTCCC GCGCTTTCCC
AGAAGCCTGG CCCTGCCCAC CGCCATCGTC CTGTCGATCG GCATGTTCGC GGCGGCTGTC
TGGGTGGTGA CCTCGAACGG GGCGGGGTTC GTGGGCCAGA TCCGCGACTA CGCGCCGCGC
CTCAATGAAG TCATCGCCAA GATCGCCTCG CTGGTCGGCG TGAAGGTCGC CCCGACCATC
GGCGACCTGA TCAATCAGCT CAATCCCTCG GCCTATGCCG GGGCGGCCGC CCAGAGCCTG
CAGAACTTCG CCTCCAGCGC CATCCTGGTG CTGATCTATC TGGGCTTCAT CATCGCCTCT
CGACGCGGTT TCAACCGCAA GATCGTGGCG CTCTATCCGC ACCATGCCGA ACGTGACGGG
GCGATGCAGC TGTTCCAGCG CATCCGGAAC GGGGTCGAGC AATACCTCTG GATCCAGACC
GTGACCGGCC TGATGATCGC CATCGCCGCC TTCGTGGTCA TGATGCTGCT GCGGCTCGAC
AACGCCCTGT TCTGGGCCTT CCTGATCTTC GTGGCCGCCT ATATCCCGAT CATCGGCGGA
GCCATCGGCT GTATCCTGCC GCCGCTGTTC GCCCTGGTGC AGTTCCCCGA CAGCTTCTGG
CCGGCCCTGA TCCTGTTCGC CGCCCTGGAG CTGATCTTCT TCGTCGTCGG CAACGTCATC
TATCCGCGGA TGCAGGGCGA CAGCCTGAAC ATCGACCCGA CGGTGGTGCT GCTGTCGCTG
GCCGTCTGGG GCGCGCTCTG GGGCGTGACG GGCATGTTCC TGTCGACTCC GCTGACCGTG
GCCCTGATGC TGATCATGGC CCAATTCGAC GGCACGCGCT GGATCGCCAT CCTGCTGTCG
GAAGACGGTA ATCCCAGTGG CGACGGCTTT GACCGCACGT CGCCGGGGAA AAAAAATCCT
TCCGAGTCAA CTTCTCGACA GAAGTCGATC AAGGGGGCTT AA
 
Protein sequence
MAIPASITGR NALVFLAVVA GGAALYWMRG ILTPLAMAVF LAVMIDSFAR VLVLRVPRFP 
RSLALPTAIV LSIGMFAAAV WVVTSNGAGF VGQIRDYAPR LNEVIAKIAS LVGVKVAPTI
GDLINQLNPS AYAGAAAQSL QNFASSAILV LIYLGFIIAS RRGFNRKIVA LYPHHAERDG
AMQLFQRIRN GVEQYLWIQT VTGLMIAIAA FVVMMLLRLD NALFWAFLIF VAAYIPIIGG
AIGCILPPLF ALVQFPDSFW PALILFAALE LIFFVVGNVI YPRMQGDSLN IDPTVVLLSL
AVWGALWGVT GMFLSTPLTV ALMLIMAQFD GTRWIAILLS EDGNPSGDGF DRTSPGKKNP
SESTSRQKSI KGA