Gene BAS3789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3789 
Symbol 
ID2847984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3755323 
End bp3757305 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content35% 
IMG OID637507027 
Productphage minor structural protein 
Protein accessionYP_030040 
Protein GI49186788 
COG category 
COG ID 
TIGRFAM ID[TIGR01665] phage minor structural protein, N-terminal region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACAC CAAGCGGAAT TCTTCATGTT GTTGACTTTA AAACAGATCA AATTATATCC 
GTTATTCAAC CAAAAGATTA TTGGGATGAT AGACGTCAAT GGGAACTTAA AAACAATGTA
GACATGCTAG AATTTAAAGT TTTTGATGGA ACACCTGAAG CTATTACATT ACAACAGCAA
AATTTGATTT TAAAAGAAGT GCGTGATGGA CGCATTGTAC CATATGTAAT TAATAATGAG
GTTGAAAGAG ATTCTAATGA TAAATCTGTT ACGGTTCATG CTTCTGGAGC TTGGGTCCAA
ATAGCAAAAG AGGGTTTTAT CAGTCCGCAA CGTATAGAGA GCAAAACGGT TAACGAATTT
ATAGACTTGG CACTCGTAGG GATGAAATGG AAACGTGGTA TTACTGAGTA TGCAGGATTC
CATACAATGA CGATAGATGA GTTTATTGAT CCTCTTACTT TCTTAAAGAA CATTGCAGCG
TTGTTTAAAT TAGAAATCCA ATATCGTGTT GAGGTTAAAG GATCACGAAT TATCGGTTGG
TATGTAGATA TGATCCAAAA ACGTGGGCGT GAAACAGGAA AAGAAATAGA ATTAGGTAAA
GATTTGGTTG GCGTAAAGCG TATTGAACAT TCACGAGAGA TTTGTACAGC TTTAATTGGA
TTTGTTAAAG GTGAGGGAGA CAAAGTAATC ACTATCGAAA GCATAAATAA AGGTCTACCT
TATATCGTAG ATGCAGATGC GTTTCAAAGA TGGAATCAAC ATGGGCAGCA TAAATTTGGT
TTCTATACGC CTGAGACAGA TCAGGAAGAC ATGACTCCAC AACGTCTTAT GACGCTTATG
AAATTGGAAC TAGCAAAGCG TGTAAATACA TTTGTATCTT ATGAAGTTGA AGCACAATCA
CTTGGCAGGG TGTTTGGTTT GGCTCATGAA TTGATTGATG AAGGGGATAT AATTCGAATT
AAAGACACAG GCTTTACACC TAAATTATAT TTAGAAGCTC GGGTTATTGC TGGTGATGAA
TCTTTTACTG ATCCTACACA AGATAAGTAT ATGTTTGGGG ATTATCGTGA GATTATTGAT
CCAAGTGAGG AATTACGAAA ATTATACAAT AAAATTCTCG GCTCATTAGG TAGTAAAGCA
AGTAAAGAAA CCCTTGAACA ACTTGAAAAA CTAGCAAAAG AAGCAAAAGA AACAGCTAAC
AATTCTAAAC AGACAGCTGA CGATGCGTCC GCAGCAGCGC AAATAGCAAA GGATATCGCA
GATGCTGTTT TAATAAAACA AAAAGATTTC CAAACGAAAA TTATAAAAAG TGCTACACCA
CCCTCTAATC CTACTAAGGA TTTGACACTA TGGTTAGACA TAAGTAACCC GGAAAAGCCA
ATCCTTTATC TGTGGAATGG AACGAAGTGG GATAGATTAA CTCCTGATAC TTCTATCATA
GACGCTGATA TAAAAGGTAT TGAGGATGAG GTTGAGAAAC TCCAAACTGA AGTTGGCTCT
AAAGTTAATC AACAATGGGT TAAGGACCAA ATTCAAACTG ACATCCAAAA TAAGGCTGAT
ATCAAAGATG TTTACAAGAA AACTGAAATC GATAAAGCTT TAGAGGGTCA TGTTAAAGTA
CAGTCGTATG AGATTGATAA AAAGGCTTTG CAGGAAGGTA TAGCAAATAA TGCAAACATA
ATTAATACAA ATGACAAGAA CTATATCAAA CGTTTTACTG AAAATGAATC TAAGATTACT
CAAACAGAAA AAGATATCAA AACACAAATT GAAGAATTGA GTGTTACGAA TAAAAAAGTT
GATTCGCAAG GAAATACGAT TGATGAAGTT CAAAAGAAAA CAAATGAAAT TGTTCAAGAT
GCTAATGGGA CAAAGCAAAC GATCACTTCT ATTCAGACTG AGCTAAAAAA TCAATTATCT
ACAGCTAGAA ACATAATAAC TAATTCTAAT TTTAATGACA ATCTTAATAC TTGGAAAAAA
TGA
 
Protein sequence
MRTPSGILHV VDFKTDQIIS VIQPKDYWDD RRQWELKNNV DMLEFKVFDG TPEAITLQQQ 
NLILKEVRDG RIVPYVINNE VERDSNDKSV TVHASGAWVQ IAKEGFISPQ RIESKTVNEF
IDLALVGMKW KRGITEYAGF HTMTIDEFID PLTFLKNIAA LFKLEIQYRV EVKGSRIIGW
YVDMIQKRGR ETGKEIELGK DLVGVKRIEH SREICTALIG FVKGEGDKVI TIESINKGLP
YIVDADAFQR WNQHGQHKFG FYTPETDQED MTPQRLMTLM KLELAKRVNT FVSYEVEAQS
LGRVFGLAHE LIDEGDIIRI KDTGFTPKLY LEARVIAGDE SFTDPTQDKY MFGDYREIID
PSEELRKLYN KILGSLGSKA SKETLEQLEK LAKEAKETAN NSKQTADDAS AAAQIAKDIA
DAVLIKQKDF QTKIIKSATP PSNPTKDLTL WLDISNPEKP ILYLWNGTKW DRLTPDTSII
DADIKGIEDE VEKLQTEVGS KVNQQWVKDQ IQTDIQNKAD IKDVYKKTEI DKALEGHVKV
QSYEIDKKAL QEGIANNANI INTNDKNYIK RFTENESKIT QTEKDIKTQI EELSVTNKKV
DSQGNTIDEV QKKTNEIVQD ANGTKQTITS IQTELKNQLS TARNIITNSN FNDNLNTWKK