Gene Caul_1425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1425 
Symbol 
ID5898880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1513375 
End bp1514715 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content69% 
IMG OID641561912 
ProductNHL repeat-containing protein 
Protein accessionYP_001683053 
Protein GI167645390 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAAG CCACCCTGCT TTTCCTGGGA TCCACCCTGG TCCTGTCAGC CTGTGGGGGC 
GGCCCCGCCC TGCCGCCCGA GCAAGGCTAC GGCCCAAGTC CTGTCCTGCC CGCCGCCAAG
AAGCAGGTTC TGCCGGTGAT GAAGATCGCC CCGGCGGTCG GGTGGGCTGA AGGGCAGACG
CCTGAGACCG CACCCGGCTT CGTCGCCACC GCCCTGGTCC GCGGCCTGGA TCATCCGCGC
TGGCTCTACG TGCTGCCAAA CGGGGACGTT CTGATCGCCG AGACCAACGC CCCACCCAAG
CCCGAGGACG GCAAGGGCAT CCGGGGTTGG TTCCAGAAGA TAATCATGAA GCGCGCCGGA
GCCACGCCCG CCTCGGCCAA CCGCATCACC CTGGTGCGCG ACGCCGACAA TGACGGAACG
CCCGAGGCCC GCACTGTCTT CCTGTCGGGC CTGAACTCGC CGTTCGGCAT GGCCTTGGTG
GGCGACACCT TCTATGTCGC CGATAGCGAC GCCCTGCTGG CCTTCCCCTA CCAGCCCGGC
CAGACGCAGA TCACCGCCGC GCCACGCAAG GTCGCCGACC TCCCGGCCGG ACCGATCAAC
CACCACTGGA CCAAGAACGT CATCGCCAGC CCCGACGGAT CCAAGCTTTA TGTGACGGTC
GGCTCCAACA GCAATGTCGG CGAGAACGGC ATGGCGAACG AGGAGCGCCG GGCCGGCATC
CTGGAGATCG ACCCGGCCAC CGGCGCCAGC CGCGTCTTCG CCTCGGGCCT GCGCAATCCC
AACGGCATGG GCTGGCAGCC CCAGAGCGGC AAGCTATGGA CCAGCGTCAA CGAGCGCGAC
GAGATCGGCA ACGACCTGGT CCCCGACTAC ATGACCTCGG TCCAGGACGG CGGCTTCTAC
GGCTGGCCCT ACAGCTACTA CGGCCAGACG GTGGACACGC GGGTCAAGCC GCAAAGGCCC
GATCTGGTGG CCAAGGCGAT CAAGCCCGAC TACGCCCTGG GCGCCCACAC CGCTTCGCTG
GGCCTGACCT TCTACGACGC CGACGCCTTC CCGGCTCGCT ACAAGGGCGG CGCCTTCATC
GGCCAACACG GCTCGTGGAA CCGCAAACCG GTCAACGGCT ACCGGGTGGC CTTCGTGCCG
TTCGCGGGCG GTGTTCCCGC GGGTCCGGCC GAGCCGTTCC TGGTCGGGTT CCTCAACGCC
AAGGGCGAGG CTCTCGGCCG GCCAGTGGGC GTGGCGGTCG ACAGGACCGG CGCCCTGCTG
GTGGCCGACG ACGTCGGCAA TGTGGTCTGG CGGGTGGCCG CCAGCGCCCC GCCGACGCCG
GCCGCGAAGC CCGCGCCCTG A
 
Protein sequence
MLKATLLFLG STLVLSACGG GPALPPEQGY GPSPVLPAAK KQVLPVMKIA PAVGWAEGQT 
PETAPGFVAT ALVRGLDHPR WLYVLPNGDV LIAETNAPPK PEDGKGIRGW FQKIIMKRAG
ATPASANRIT LVRDADNDGT PEARTVFLSG LNSPFGMALV GDTFYVADSD ALLAFPYQPG
QTQITAAPRK VADLPAGPIN HHWTKNVIAS PDGSKLYVTV GSNSNVGENG MANEERRAGI
LEIDPATGAS RVFASGLRNP NGMGWQPQSG KLWTSVNERD EIGNDLVPDY MTSVQDGGFY
GWPYSYYGQT VDTRVKPQRP DLVAKAIKPD YALGAHTASL GLTFYDADAF PARYKGGAFI
GQHGSWNRKP VNGYRVAFVP FAGGVPAGPA EPFLVGFLNA KGEALGRPVG VAVDRTGALL
VADDVGNVVW RVAASAPPTP AAKPAP