Gene Caul_0307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0307 
Symbol 
ID5897581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp343629 
End bp344888 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content68% 
IMG OID641560791 
ProductNa+ dependent nucleoside transporter 
Protein accessionYP_001681942 
Protein GI167644279 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.140644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTCA ACGAGGTCAC ACGCGGCCTG CTGGGCGTGG TCGCCTTCAT CGCCCTGGGT 
TGGCTGCTGT CGGAGAACAA GCGCGCCTTC CCCGTTCGTC CGGTGCTGGT CGGCCTGGCC
TGCCAGATCG GCCTGGCGGT TCTGCTGACC CGAGTTCCCG CCGTCACAGC CGGCTTCGCC
GCCGCCACCC ATGGGGTCGA CGCCATCCAG GCGGCCTCGC GCGCGGGGTC GAGCTTCATG
TTCGGCTATC TGGGCGGCGG TCGCGCGCCG TTCAGCGTCG CCGATCCGGG CGCCGCCTTC
ATCTTCGCCT TCCAGGCCCT GCCGGCCATC CTGCTGGTCG GCGCGCTGTC GGCCCTGCTC
TGGCACTGGC GGATCCTGAT CTGGATCGTC CGCGCAGCCG CCTGGCTGTT TGGCAAGCTT
TTCGGCGTCA GCGGTCCGGT CGGGGTCTCG ACCTCGGCCT GCGTGTTCCT GGGCATGGTC
GAGGCGCCGC TGCTGGTCAA GCCGTTCCTG CCCAAGCTGT CGCGGGGCGA GCTGTTCATC
ATCATGGTCG ACGGCCTGTC GGTGATCGGC GGCTCGATGA TGATCGTGCT CGGCTCGATG
ATCGCGGCCA AGGTGCCGGG CGCGTTCAGC CACCTGCTGA TCGCCTCGCT GATCAGCACG
CCCATGGCCA TCGGCATGGC CCGGCTGATC ATCCCCACCG CCGACCGCGA CATCCGCGAG
CCGATCGACC TGACCAGCCC CTACCGCAGC AGCCTGGAGG CCATGACCTT CGGCACGCTG
GACGCGGTCA AGATGGTGCT GAACATCGCC GGACTGCTGA TCGTCTTCGT CTCGTTGATC
GCCTTGATCA ACATGGGCTT GGCCGCCCTG CCCCACGCGG GACCGCCGCT CACCCTGGGC
TTCCTGCTGG GCAAACTGCT GACCCCGATC GTCTGGCTGA CAGGCGCGCC GATTGGCGAC
CTGCAGACGG TGGGATCGCT TCTCGGCACC AAGGTCGCGG CGAACGAGGT GGTGGCCTAT
AGCGACATGA TGGCCCTGCC GGCCGGGGCC TTGCAGCCCA AGAGCCTGCT GATCCTGACC
TACGCCCTGG GCAGCTTTGG CAATGTCGGC AGCGTGGCCA TCCTGATCGG CAGCCTGTCG
TCCATGGCGC CGGACAAGGT CGGCGAGGTG GTCGAGCTGG GCTTCAAGGC CCTGGCGGCG
GCCTTCCTGA CCACCTGCCT GACCGCCACG ATCATGGGCC TCCTCAGCGC CCTACCCTAG
 
Protein sequence
MIVNEVTRGL LGVVAFIALG WLLSENKRAF PVRPVLVGLA CQIGLAVLLT RVPAVTAGFA 
AATHGVDAIQ AASRAGSSFM FGYLGGGRAP FSVADPGAAF IFAFQALPAI LLVGALSALL
WHWRILIWIV RAAAWLFGKL FGVSGPVGVS TSACVFLGMV EAPLLVKPFL PKLSRGELFI
IMVDGLSVIG GSMMIVLGSM IAAKVPGAFS HLLIASLIST PMAIGMARLI IPTADRDIRE
PIDLTSPYRS SLEAMTFGTL DAVKMVLNIA GLLIVFVSLI ALINMGLAAL PHAGPPLTLG
FLLGKLLTPI VWLTGAPIGD LQTVGSLLGT KVAANEVVAY SDMMALPAGA LQPKSLLILT
YALGSFGNVG SVAILIGSLS SMAPDKVGEV VELGFKALAA AFLTTCLTAT IMGLLSALP