Gene BAS1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1531 
Symbol 
ID2848689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1561565 
End bp1562785 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content36% 
IMG OID637504785 
Producthypothetical protein 
Protein accessionYP_027798 
Protein GI49184546 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AATTATTATT ATTGTCGGGA ACAGGAATTT CTCGTTTAGG TGATTTTATG 
TATCTTATCG CCTTAAACGT AATGGTGTTA CATAGCACAA ACTCCCCTGC TGCTGTAGCA
GGATTATGGA TTGTAGGTCC AATTGCCACC GTATTTACGA AAATATGGTC TGGTAGCATA
GTAGATCGAT TAAACAAACG GTCCATTATG CTTATCACAG ATATCATTCG GGCAGCTCTC
ATTGGCTGTA TACCACTATT TGATTCTATT TGGGCCATTT ACATTTTTAT CTTTTTGACT
CGCATTGCTA CATCATTTTT CGATCCGGCT TCATTTTCTT ATAAAACAAT GCTCATACGT
GCTGAAGAAC GCGCGCAATT CAACGCTTGG AGTAACTTTT GTACAAGCGG AGCTTTCATT
ATCGGTCCAG CTCTTGCTGG AATACTTCTC ACCACGCACT CAGCAACCTT TGTTATTTAC
TGCAACTCAC TTTCCTTTCT ACTTTCTACC ATTTTCATTT ACTTCTTGCC AAACATTGCA
TTACAAACAA AGCAAAACGA AGAAGTTGCA AATACTTTCG TACAAACATT ACGAAATGAT
TGGAAACAAG TTTTTTCATT TGCTCGAACA GAAACTTACA TTATTCTCAT ATTCGTTTTA
TTTCAAGCGA CTATGCTTGT CGCTATGGCA CTCGATTCAC AAGAAGTTGT TTTCACAAAA
CAAGTACTGC TTTTATCCAA CATGGAATAT AGCATGCTTG TTAGTATAAC TGGTGCAGCT
TACGTTTTCG GTTCATTTCT TGTTTCTCTC TTTGCTAAAC GATTACCGAT TCAATATTGT
ATCGGATTCG GTATGATTTT TACAGCAATA GGCTATGTAA TTTTCGCTTT TTCAAATTCA
TTTATCGTCG CAGCAGGCGG TTTTATTTTA CTTGGAGTGT CTTCATCATT TGCTGGTACT
GGCTTTATAA CATTTTATCA AAATAACATA CCTGTACATA TGATAGGACG TATTGATAGC
GTGTTTGATT CCATAAAAAG TTTTATCCAA GTCTTTTTCA TTTTAGCAAT TGGAGCATCC
GCACAATTTC TTTCCGTCCA AATTACTGTA ATAAGTAGCT CGTTACTCAT TCTTTTCCTT
TCCTGTTTAT TAGCAATCCG GGTAATGACT CCTTCACGTG AAAAATATTT TAAAGCGACA
GAGTCATCAT TGGAATACTA A
 
Protein sequence
MKNKLLLLSG TGISRLGDFM YLIALNVMVL HSTNSPAAVA GLWIVGPIAT VFTKIWSGSI 
VDRLNKRSIM LITDIIRAAL IGCIPLFDSI WAIYIFIFLT RIATSFFDPA SFSYKTMLIR
AEERAQFNAW SNFCTSGAFI IGPALAGILL TTHSATFVIY CNSLSFLLST IFIYFLPNIA
LQTKQNEEVA NTFVQTLRND WKQVFSFART ETYIILIFVL FQATMLVAMA LDSQEVVFTK
QVLLLSNMEY SMLVSITGAA YVFGSFLVSL FAKRLPIQYC IGFGMIFTAI GYVIFAFSNS
FIVAAGGFIL LGVSSSFAGT GFITFYQNNI PVHMIGRIDS VFDSIKSFIQ VFFILAIGAS
AQFLSVQITV ISSSLLILFL SCLLAIRVMT PSREKYFKAT ESSLEY