Gene BAS4380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4380 
Symbol 
ID2851697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4292464 
End bp4293789 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content34% 
IMG OID637507617 
Productmajor facilitator family transporter 
Protein accessionYP_030627 
Protein GI49187375 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000603608 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAAG ATATAAGTAA CCATTCAAAA TGGTTTGTTT TCACATTATG TTTTATCGTT 
TTATTAGGAC CGGTGAATGC GGTTTTATTT AATGTTGCGT TAGAGGATAT GGCTCATGAT
TTATCCATTA GTCAATCGAA AGTAAGTTGG GTTGTAGTAG GTTACTCCTT AGTTGTCGGT
ATTGGTTCGA TGATATATGG GAAACTGGCT GATCGTTATA GTGTGAAAAA ACTATTAATT
ATTTCAATCA TCATATTTGT AGCGGGTTCT ATTATTGGAT TTGTAAATCA ATCATATGCG
ATTGTCATTT TTGCAAGATT AGTGCAGGCG AGTGGGGGCG CGGCGTTTAT TGCGCTTAGT
ATGATTGCGG TAGCAAAATT AGTTGCTCCT GCTAAGAAGC CTGGTGCTTT AGCGATGATT
AGTTCTTCTA TTGCGTTAGC GGTTGGTATT GGTCCTTTAG TTGGTGGGGC TATTACAAAT
ACACTAGGGT GGCCATATTT ATTTTTATTT ATGATTATCT CAGTATTGGG GATTTTCTTG
CTTATAAAAT TTATGCCAGG AGAAGCGCAT CATACGGATG AAGTGTTTTA TTTTGATTAC
ATTGGAGCGG CGTTACTATT TGTATTTATT ACGACTGTTT TGGTAGGTGT TAATATGAAT
AGTTGGCTAT TTGTGTTATC GATAATTTCC TTATTTTTAT TCACGGTTCG TATGAAGAAA
GCGGAGCACC CATTTATCGA TATTGAGTTA TTTTCGAACA AAGCATTTCT TCGTTTAATA
ACAGTCGGAT TTATAATTAA TGTGGCGTTA TGTGCTAATT TATTATTATT GCCATTACTG
TTAGGAAGAG TACACGGATT GTCGCCGTTT ATTATCGGAA TTGTATTATT TGTTGCATCA
CTTTTCGGTA TTGTGTCTAG TTTTATTACT GGAAAGATTA TCCCTTCGTT TGGAAATGTG
AATATGATTT ATGTAGCGTC TGTCATTATG ATTGTTGGCT TTTTAATTTT GGGGTTTATT
CCGAATGGAA GTATAGTCGT TATTGTATTG GCGATTATTT TAACGTTTAT GAGTTATTCT
GCCATTCAAG TATCATTGAA CACATTTATA CCGAAAACAT TACATGTAGC TAAAGTTGGA
GTCGGTCTTG GTTTATATAA TTTAATTAAC TTTTTCGGTA TGGCATTTGG ACCAGCTGTA
GCAAGCCGAA TTATGGAATC TACAAATAGT TATCGTTTTA ATTTTATTTT AATCGTCATG
TTAATTTCTG CTCATTTCTT CTTATTAATA GGAATGTCTT CTTTCCGAAA AAAGATGGAG
CATTAA
 
Protein sequence
MGKDISNHSK WFVFTLCFIV LLGPVNAVLF NVALEDMAHD LSISQSKVSW VVVGYSLVVG 
IGSMIYGKLA DRYSVKKLLI ISIIIFVAGS IIGFVNQSYA IVIFARLVQA SGGAAFIALS
MIAVAKLVAP AKKPGALAMI SSSIALAVGI GPLVGGAITN TLGWPYLFLF MIISVLGIFL
LIKFMPGEAH HTDEVFYFDY IGAALLFVFI TTVLVGVNMN SWLFVLSIIS LFLFTVRMKK
AEHPFIDIEL FSNKAFLRLI TVGFIINVAL CANLLLLPLL LGRVHGLSPF IIGIVLFVAS
LFGIVSSFIT GKIIPSFGNV NMIYVASVIM IVGFLILGFI PNGSIVVIVL AIILTFMSYS
AIQVSLNTFI PKTLHVAKVG VGLGLYNLIN FFGMAFGPAV ASRIMESTNS YRFNFILIVM
LISAHFFLLI GMSSFRKKME H