Gene GBAA_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3037 
Symbol 
ID2817544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp2801726 
End bp2802835 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content33% 
IMG OID637789839 
Producthypothetical protein 
Protein accessionYP_019678 
Protein GI47528329 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.384014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAC AGCTTGTGTA TACACCATTA GCGGAGCCTT TCGTAATGGG AGTTATGTCA 
GTTGTAGCTG TTATTTTATG CGTATTTTCA AGCAATCTAC TGTTTTTATC TCTCGTATTT
TTGTATGTAA TTTTAATAGG TGCTATGCAT GTATACATAC GTAAAGTATC TCGTGTTGAA
TGGGAATATA GCCAAGGGAA TTCAAACGTT TTTATAGGTG AAACGAATAT GTGCAAAATG
AAAATTTCAA ATAAGTCGAT ATTTCCTATT TTCAATATCG TATTTCGATT TAAATGTGAA
AACAAGCTAA CTTGGAATCA TGATGAAATA AACAAAAATA CGAATACAGG TTCAAATTAT
TATATGAATT TTAATTTAAA AGGAGGAGAG TCAGCTTCAT TTCATTTACA AGCTGTAGCG
TTAAAAAGAG GAATTGCGAA ATGGGAAGAA GTTGAAATTG TTATTACGGA TCCTTTTGGA
TTTATAACGA ATCATATAAC ATATAAACAA GTCGATACGC CGTCCTATTT AGTTTTACCA
GCTGTTCCAA AAATGCAAGT CCCTGAATTA CAAGAATGGT CACGAGGATT TCGAAAAGCG
ATGTCTTCAC CCTTATATGA TGAAACGAAA GTAATGGGAG TGAAGTCTTA TGAAAATGAA
GATTTTCGTT CCATCCACTG GAGTGCAACA GCGAAAACAG GGACGATAAC TGCGAAAAAG
TATGAGCGAA CGCAATCAGA TAAATACGCG ATTTATCTCA ACTTGCAAAA TAAAAGTGGC
GTTTCATTGC GAAATGATAT AGAAGAATTA ATTGAATTAA CAGCAGGCAT ATGTAAACAA
CTTCTTATGC AAAACTGTTC ATTTGAATTA TGGATTAATA GTGTAAAGGA TAACGGTTTG
CTACATGTAA AGAATGGTGA TAATCGGAAA CATTTGCAAA ATGTATTAAA AATACTTGCC
TCAATATCGG ATCAAGATAC GCCTGTATCT TCTTCTTATT TTTACACAGC AGGCTTTCGT
CGTAAGGAAC TGGATGCGGT TCCTTTAATT CTTGGTACTT CACCAAAGAA ATATAGTAGA
ACAAATAAAT GGATTGTAAT GAAAGAATAA
 
Protein sequence
MNQQLVYTPL AEPFVMGVMS VVAVILCVFS SNLLFLSLVF LYVILIGAMH VYIRKVSRVE 
WEYSQGNSNV FIGETNMCKM KISNKSIFPI FNIVFRFKCE NKLTWNHDEI NKNTNTGSNY
YMNFNLKGGE SASFHLQAVA LKRGIAKWEE VEIVITDPFG FITNHITYKQ VDTPSYLVLP
AVPKMQVPEL QEWSRGFRKA MSSPLYDETK VMGVKSYENE DFRSIHWSAT AKTGTITAKK
YERTQSDKYA IYLNLQNKSG VSLRNDIEEL IELTAGICKQ LLMQNCSFEL WINSVKDNGL
LHVKNGDNRK HLQNVLKILA SISDQDTPVS SSYFYTAGFR RKELDAVPLI LGTSPKKYSR
TNKWIVMKE