Gene Caul_2369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2369 
Symbol 
ID5899824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2571651 
End bp2572994 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content66% 
IMG OID641562860 
Productmajor facilitator transporter 
Protein accessionYP_001683994 
Protein GI167646331 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAAT CATCCCATCC GGTATCGGCG TTCCGCGCCT GGCTGGCGCT GGCGATCATG 
ATCGCGGCGA TGTTGTATGC GCTCGTGGAT CGTCAGGTGT TCCTGCTGGT CGCCGCCGAG
ATGAGCAAGA CGCTGCGTCT CAGCAACACG CAACTCGGCC TCATCCAAGG TGTGGGCTTC
GCGGCCCTGA CCTTGCTCGG CGCCTATCCG ATCGCCTGGT TCGCCGACCG TTACGATCGA
CGTTGGGTGC TCGCCATTTG CATTCTGTGC TGGGCCATGG GGACGGCGGC TTGCGGTCTT
GCGAACGGCT TTTCGCCCTT GTTCATTGCG GCCGGGGCCG TGGCGGTCTC CGAAGCCGGC
ATCGCCCCGA TCTTCATGTC GATGCTGCCC GAGCTGTTCC GCGGCCAAGC CAGGGTGACC
GCAACCATGA TCTACTATGT CGCGGTCTCC CTGGGCATGG CGGCGGGCAT GTTCGTGGTC
GGCGCCATGA TGGCGGCGGT CGATGCGCTC AAGCCGCTGC CCGGGTTCTT GAGCGGGCTG
GAAAACTGGC GCCTGGCCTA TCTGGCGGCG GCGGCTCCAT TTCCGGTCTT GATCGCGATG
ATCTTCTTCT TGCCGATCGG GCGAGTCCCG GGCGCGCGAG CGAAAGCGAC GGCCGCGCCG
ATCACGCCGT TCTTGCGCGC GCACTTCAAG TCTGTGGCGC TGGTGTTCGG GGCGATGACC
TTCTTCGCCC TGGGCGTGAC GAGCGTTCTG GCTTGGACGC CCGTGTCGCT GACCCGGATC
TTCGGCCTGA GCCCCGCGTC TGTCGGCATG GTGCTGGGCG CGGTGATCGC CGCCGCGAGC
GTCGCCGGGG TCACGGCTGG CAATTTCGTC ATGCCGCCCC TGCAGCGGCG GATCGGCTAT
CGTGCCGCCC CTCGCATCGT CTGGGTGTCG CTGATCGCGT CGCTCCCGCT GGTCTGCCTC
ATTCCCTTCG CGACGGCGCC CTGGCAGGTC TTCGCCTGTG TCGGCGTCCA GGTTTTCGCC
TCGACGATCG CCGGGGCCTC GAGCGTCAGC CTGCTGCAGG ACTTGGCGCC CCCTGAAGTA
CGGTCGCGCA TCATGGCGCT CCGCGCGATG ACCAATGGAC CGGCAATCGG CCTGGGCATA
GCCGGCTCGG CGTTTCTGGG CGACGTCATC AAGGCGGGGC CGCAGAGCCT GTTCTGGGGC
GGCCTGTGCA TAACCGTTCC GGCCTGGATC GCCACCATCG TGATGCTGCG GCTCGCCGAG
AAGCCCTTCG AGGTTACGGC TCGTGAGAGC ACGGGCATGC GCAGCCCACT CGACTTTTCT
GCGCCGACCA AAGACGTCGG TTAG
 
Protein sequence
MQQSSHPVSA FRAWLALAIM IAAMLYALVD RQVFLLVAAE MSKTLRLSNT QLGLIQGVGF 
AALTLLGAYP IAWFADRYDR RWVLAICILC WAMGTAACGL ANGFSPLFIA AGAVAVSEAG
IAPIFMSMLP ELFRGQARVT ATMIYYVAVS LGMAAGMFVV GAMMAAVDAL KPLPGFLSGL
ENWRLAYLAA AAPFPVLIAM IFFLPIGRVP GARAKATAAP ITPFLRAHFK SVALVFGAMT
FFALGVTSVL AWTPVSLTRI FGLSPASVGM VLGAVIAAAS VAGVTAGNFV MPPLQRRIGY
RAAPRIVWVS LIASLPLVCL IPFATAPWQV FACVGVQVFA STIAGASSVS LLQDLAPPEV
RSRIMALRAM TNGPAIGLGI AGSAFLGDVI KAGPQSLFWG GLCITVPAWI ATIVMLRLAE
KPFEVTARES TGMRSPLDFS APTKDVG