Gene BAS1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1002 
Symbol 
ID2847916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1054544 
End bp1055896 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content36% 
IMG OID637504261 
Producthypothetical protein 
Protein accessionYP_027275 
Protein GI49184023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00204136 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTGC AAACAGAAAC ATACCGCGCA GCTATGAATG GAACATTAGA ACGTCATTTT 
TCAGATATGA TTGCTGTTAT ACCGACTAGA ATTACAATTG AGCAGTTAAA ACAACGGCTA
GAAAATATCG CTACTAAAGT TGATGAGTTA AAAATTGTTT ATAGTGATGA GACAAGCCTT
ATTGTTGAGT TACATATGGA TAATAAAGTC ATACCGTATG AACTGCATAT TGATGAAACG
GATGATCCAG AAGAATACAA ACTATACAAT AGACAAGATT CCACAATCGT AGACCGTTCT
TTTGAAGATG CGGCTTATGG TACTGAAATT TTCACCCGTA CGCTATTTGT AGGCGATGTA
CTGAACTGCT TTTTCCAGCA GTTACAGTTT TTATGGAACC TTGCGCCAGA TTTGTTATTC
GTAATTGATT CAAGTGCAGC AATGAAGGTA ATATCTAGAA ACTATATTGA ATATCACGTT
GAAAATGAAT TATTACCTGA CATTCCTGAC TTGTACGTTA TTCATTCTGT TTATGAAGAC
GATAAAGAAG GCGAGCCTAC GCAATATTGG TTTCATACAC ACGGCCTTTT AAGAGCGGGC
GTAACAGAAA TAGAATTAAT TATTCCAAAT CGCATTTCTT CCTACTATGG CATTGGTGAC
CTCTTTCAAA CATTTGCGAA TAATGCCGTT GAAAATGGGC AAGTTCCTAT GAATGAGCCT
ATCGTTATCG CACATAGTCA GCAAGGTTCT ATACATACAG TAGCTGTGCC GTGGGAAAAA
GGTTTATCTT ATATTGGGCA TAAAACGAGT ATGGATCAAT TATCTTCAAT TGAGGATGAA
GAAGTGAAGC TACAACCAAT AAGTGCACAA AACACATTCT TAGGCGGGAT GGATGACCGA
GATGAATACC ATCAATCGCC ATCTGTTCTC TTGTTCAAAT TTGATACTTC AGAAGAATAT
ATCGAAAGCT TTTTCAAAGA ACACGAGGAA GCTACAGGGC TCATGTTCTA TAAAACAAAT
AGTGAAACGG CTCGTATGGC TTACAATGCG AAGAATACTT TCGGGTATTT CAGCAACATT
TTTCAAATTG AACAATCAAA TGAGGAGTTT CGTTTTCTCG CTAAGTTTGG CGTTTCCTAT
GAAGAGGGTA AAAGCGAGCA TATGTGGTTT GAAATGCAAC ATATTACGGA AGAATTTATT
CAAGGAATAC TCATTAATGA ACCATATTTT ATAGAAGATA TGAGTGAAGG AAATAGTTAT
CATTTAGATT TTGATGACTT AACAGAATGG GTTATTTATG CAGGAGATGC CGTTATAAAG
CCAAATAACT TATATATGTT TATTGGTGAA TAA
 
Protein sequence
MEVQTETYRA AMNGTLERHF SDMIAVIPTR ITIEQLKQRL ENIATKVDEL KIVYSDETSL 
IVELHMDNKV IPYELHIDET DDPEEYKLYN RQDSTIVDRS FEDAAYGTEI FTRTLFVGDV
LNCFFQQLQF LWNLAPDLLF VIDSSAAMKV ISRNYIEYHV ENELLPDIPD LYVIHSVYED
DKEGEPTQYW FHTHGLLRAG VTEIELIIPN RISSYYGIGD LFQTFANNAV ENGQVPMNEP
IVIAHSQQGS IHTVAVPWEK GLSYIGHKTS MDQLSSIEDE EVKLQPISAQ NTFLGGMDDR
DEYHQSPSVL LFKFDTSEEY IESFFKEHEE ATGLMFYKTN SETARMAYNA KNTFGYFSNI
FQIEQSNEEF RFLAKFGVSY EEGKSEHMWF EMQHITEEFI QGILINEPYF IEDMSEGNSY
HLDFDDLTEW VIYAGDAVIK PNNLYMFIGE