Gene Acid345_2559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2559 
Symbol 
ID4072203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3023005 
End bp3024228 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content59% 
IMG OID637984576 
Productmajor facilitator transporter 
Protein accessionYP_591634 
Protein GI94969586 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00891] putative sialic acid transporter 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCG CCCCCGGAAG CGCCTCCGGC CTGACCACCT CGCAACGCAC CCATGCAGTT 
CTCGCCGGCT ACCTCGGCTG GACGATGGAC GCGTTCGACT TCTTCGTCGT GGTGTTCATG
CTCGGCACCC TCGCAGAAGC GTTCGCGGTA AAGAAATCCG AAATCGTCTT CACCATGACG
ATCACATTGG CGATGCGTCC GGTGGGCGCG TTCCTGTTCG GTTTGCTGGC GGACCGGTTC
GGACGCCGCG TTCCGTTCAT GGCGAATGTC ATCTACTTCT CGCTGATCGA GGTGCTCTGC
GGCTTCGCTC CGAATTACAA AGTCTTCCTG CTGCTCCGCG CGCTCTACGG TATCGGCATG
GGCGGCGAGT GGGGAATTGG CGCATCGCTG GCGATGGAGA GCATCCCACA GCGTTTGCGC
GGTATGGTCT CCGGTGTCTT GCAAAGCGGC TATTCCGCCG GATACTTGCT GGCGGCGCTC
GCCTATCGTT TCGTTTTTCC AGGTCTCGGT TGGCGATGGA TGTTCTGGAT CGGAGGCATA
CCGGCGGTAT TGGCGTTGTA CATCCGCTGG CATGTGCCCG AATCCGACGC GTGGAAGGAG
CATGCCGCGA ATAAGGTGTC TGACATCATG CGGGTGTTCG CGGGCTATTG GAAATCGTTT
GCATATCTGC TCGTGATGAT GACGCTGTTT ATGTTCCTCT CGCATGGCAC GCAGGACCTT
TATCCTGACT TCCTGAAAAC CGAACACAAT CTCAGCGCCG CATGGGTTTC GTATATTGCG
ATCATCTACA ACATTGGCGC GATTGTCGGG GCGATCATCT TCGGCCTGAT CTCGCAGCGA
ATGGGGCGAC GGAAGGGAAT TGTCTTCGCC CTCTTCCTGT CGTTCCTCAC GATTCCCGCC
TGGGCATTCG GCCACGGCTT GGTCGTGGTT GCGGCCGCGG CGTTCCTGAT GCAAGTCGGC
GTTCAAGGCG CGTGGGGCGT TGTGCCGGTG CACCTGAACG AGCTTGCGCC TGACGCGGCT
CGCGGGCTCG TGCCGGGCTT TGCCTATCAA CTCGGCATCC TGTTTGCGTC CGGTACGAAC
AACATTGAGT ACGCGCTGCG CGATCATTTC GGATATCGCT GGGCGCTTGC CGGGTTTGAA
ATTTTCACCA TCATCAGTCT CGCCATCGTG GTCTGGTTTG GGCGCGAGGC GCACGGCAAA
CAATTCAGCA AATTGAGCAC TTGA
 
Protein sequence
MDTAPGSASG LTTSQRTHAV LAGYLGWTMD AFDFFVVVFM LGTLAEAFAV KKSEIVFTMT 
ITLAMRPVGA FLFGLLADRF GRRVPFMANV IYFSLIEVLC GFAPNYKVFL LLRALYGIGM
GGEWGIGASL AMESIPQRLR GMVSGVLQSG YSAGYLLAAL AYRFVFPGLG WRWMFWIGGI
PAVLALYIRW HVPESDAWKE HAANKVSDIM RVFAGYWKSF AYLLVMMTLF MFLSHGTQDL
YPDFLKTEHN LSAAWVSYIA IIYNIGAIVG AIIFGLISQR MGRRKGIVFA LFLSFLTIPA
WAFGHGLVVV AAAAFLMQVG VQGAWGVVPV HLNELAPDAA RGLVPGFAYQ LGILFASGTN
NIEYALRDHF GYRWALAGFE IFTIISLAIV VWFGREAHGK QFSKLST