Gene GBAA_4078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_4078 
Symbol 
ID2819763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3754950 
End bp3756932 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content35% 
IMG OID637790788 
Productphage minor structural protein 
Protein accessionYP_020721 
Protein GI47529372 
COG category 
COG ID 
TIGRFAM ID[TIGR01665] phage minor structural protein, N-terminal region 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACAC CAAGCGGAAT TCTTCATGTT GTTGACTTTA AAACAGATCA AATTATATCC 
GTTATTCAAC CAAAAGATTA TTGGGATGAT AGACGTCAAT GGGAACTTAA AAACAATGTA
GACATGCTAG AATTTAAAGT TTTTGATGGA ACACCTGAAG CTATTACATT ACAACAGCAA
AATTTGATTT TAAAAGAAGT GCGTGATGGA CGCATTGTAC CATATGTAAT TAATAATGAG
GTTGAAAGAG ATTCTAATGA TAAATCTGTT ACGGTTCATG CTTCTGGAGC TTGGGTCCAA
ATAGCAAAAG AGGGTTTTAT CAGTCCGCAA CGTATAGAGA GCAAAACGGT TAACGAATTT
ATAGACTTGG CACTCGTAGG GATGAAATGG AAACGTGGTA TTACTGAGTA TGCAGGATTC
CATACAATGA CGATAGATGA GTTTATTGAT CCTCTTACTT TCTTAAAGAA CATTGCAGCG
TTGTTTAAAT TAGAAATCCA ATATCGTGTT GAGGTTAAAG GATCACGAAT TATCGGTTGG
TATGTAGATA TGATCCAAAA ACGTGGGCGT GAAACAGGAA AAGAAATAGA ATTAGGTAAA
GATTTGGTTG GCGTAAAGCG TATTGAACAT TCACGAGAGA TTTGTACAGC TTTAATTGGA
TTTGTTAAAG GTGAGGGAGA CAAAGTAATC ACTATCGAAA GCATAAATAA AGGTCTACCT
TATATCGTAG ATGCAGATGC GTTTCAAAGA TGGAATCAAC ATGGGCAGCA TAAATTTGGT
TTCTATACGC CTGAGACAGA TCAGGAAGAC ATGACTCCAC AACGTCTTAT GACGCTTATG
AAATTGGAAC TAGCAAAGCG TGTAAATACA TTTGTATCTT ATGAAGTTGA AGCACAATCA
CTTGGCAGGG TGTTTGGTTT GGCTCATGAA TTGATTGATG AAGGGGATAT AATTCGAATT
AAAGACACAG GCTTTACACC TAAATTATAT TTAGAAGCTC GGGTTATTGC TGGTGATGAA
TCTTTTACTG ATCCTACACA AGATAAGTAT ATGTTTGGGG ATTATCGTGA GATTATTGAT
CCAAGTGAGG AATTACGAAA ATTATACAAT AAAATTCTCG GCTCATTAGG TAGTAAAGCA
AGTAAAGAAA CCCTTGAACA ACTTGAAAAA CTAGCAAAAG AAGCAAAAGA AACAGCTAAC
AATTCTAAAC AGACAGCTGA CGATGCGTCC GCAGCAGCGC AAATAGCAAA GGATATCGCA
GATGCTGTTT TAATAAAACA AAAAGATTTC CAAACGAAAA TTATAAAAAG TGCTACACCA
CCCTCTAATC CTACTAAGGA TTTGACACTA TGGTTAGACA TAAGTAACCC GGAAAAGCCA
ATCCTTTATC TGTGGAATGG AACGAAGTGG GATAGATTAA CTCCTGATAC TTCTATCATA
GACGCTGATA TAAAAGGTAT TGAGGATGAG GTTGAGAAAC TCCAAACTGA AGTTGGCTCT
AAAGTTAATC AACAATGGGT TAAGGACCAA ATTCAAACTG ACATCCAAAA TAAGGCTGAT
ATCAAAGATG TTTACAAGAA AACTGAAATC GATAAAGCTT TAGAGGGTCA TGTTAAAGTA
CAGTCGTATG AGATTGATAA AAAGGCTTTG CAGGAAGGTA TAGCAAATAA TGCAAACATA
ATTAATACAA ATGACAAGAA CTATATCAAA CGTTTTACTG AAAATGAATC TAAGATTACT
CAAACAGAAA AAGATATCAA AACACAAATT GAAGAATTGA GTGTTACGAA TAAAAAAGTT
GATTCGCAAG GAAATACGAT TGATGAAGTT CAAAAGAAAA CAAATGAAAT TGTTCAAGAT
GCTAATGGGA CAAAGCAAAC GATCACTTCT ATTCAGACTG AGCTAAAAAA TCAATTATCT
ACAGCTAGAA ACATAATAAC TAATTCTAAT TTTAATGACA ATCTTAATAC TTGGAAAAAA
TGA
 
Protein sequence
MRTPSGILHV VDFKTDQIIS VIQPKDYWDD RRQWELKNNV DMLEFKVFDG TPEAITLQQQ 
NLILKEVRDG RIVPYVINNE VERDSNDKSV TVHASGAWVQ IAKEGFISPQ RIESKTVNEF
IDLALVGMKW KRGITEYAGF HTMTIDEFID PLTFLKNIAA LFKLEIQYRV EVKGSRIIGW
YVDMIQKRGR ETGKEIELGK DLVGVKRIEH SREICTALIG FVKGEGDKVI TIESINKGLP
YIVDADAFQR WNQHGQHKFG FYTPETDQED MTPQRLMTLM KLELAKRVNT FVSYEVEAQS
LGRVFGLAHE LIDEGDIIRI KDTGFTPKLY LEARVIAGDE SFTDPTQDKY MFGDYREIID
PSEELRKLYN KILGSLGSKA SKETLEQLEK LAKEAKETAN NSKQTADDAS AAAQIAKDIA
DAVLIKQKDF QTKIIKSATP PSNPTKDLTL WLDISNPEKP ILYLWNGTKW DRLTPDTSII
DADIKGIEDE VEKLQTEVGS KVNQQWVKDQ IQTDIQNKAD IKDVYKKTEI DKALEGHVKV
QSYEIDKKAL QEGIANNANI INTNDKNYIK RFTENESKIT QTEKDIKTQI EELSVTNKKV
DSQGNTIDEV QKKTNEIVQD ANGTKQTITS IQTELKNQLS TARNIITNSN FNDNLNTWKK