Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_4078 |
Symbol | |
ID | 2819763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | - |
Start bp | 3754950 |
End bp | 3756932 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637790788 |
Product | phage minor structural protein |
Protein accession | YP_020721 |
Protein GI | 47529372 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01665] phage minor structural protein, N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAACAC CAAGCGGAAT TCTTCATGTT GTTGACTTTA AAACAGATCA AATTATATCC GTTATTCAAC CAAAAGATTA TTGGGATGAT AGACGTCAAT GGGAACTTAA AAACAATGTA GACATGCTAG AATTTAAAGT TTTTGATGGA ACACCTGAAG CTATTACATT ACAACAGCAA AATTTGATTT TAAAAGAAGT GCGTGATGGA CGCATTGTAC CATATGTAAT TAATAATGAG GTTGAAAGAG ATTCTAATGA TAAATCTGTT ACGGTTCATG CTTCTGGAGC TTGGGTCCAA ATAGCAAAAG AGGGTTTTAT CAGTCCGCAA CGTATAGAGA GCAAAACGGT TAACGAATTT ATAGACTTGG CACTCGTAGG GATGAAATGG AAACGTGGTA TTACTGAGTA TGCAGGATTC CATACAATGA CGATAGATGA GTTTATTGAT CCTCTTACTT TCTTAAAGAA CATTGCAGCG TTGTTTAAAT TAGAAATCCA ATATCGTGTT GAGGTTAAAG GATCACGAAT TATCGGTTGG TATGTAGATA TGATCCAAAA ACGTGGGCGT GAAACAGGAA AAGAAATAGA ATTAGGTAAA GATTTGGTTG GCGTAAAGCG TATTGAACAT TCACGAGAGA TTTGTACAGC TTTAATTGGA TTTGTTAAAG GTGAGGGAGA CAAAGTAATC ACTATCGAAA GCATAAATAA AGGTCTACCT TATATCGTAG ATGCAGATGC GTTTCAAAGA TGGAATCAAC ATGGGCAGCA TAAATTTGGT TTCTATACGC CTGAGACAGA TCAGGAAGAC ATGACTCCAC AACGTCTTAT GACGCTTATG AAATTGGAAC TAGCAAAGCG TGTAAATACA TTTGTATCTT ATGAAGTTGA AGCACAATCA CTTGGCAGGG TGTTTGGTTT GGCTCATGAA TTGATTGATG AAGGGGATAT AATTCGAATT AAAGACACAG GCTTTACACC TAAATTATAT TTAGAAGCTC GGGTTATTGC TGGTGATGAA TCTTTTACTG ATCCTACACA AGATAAGTAT ATGTTTGGGG ATTATCGTGA GATTATTGAT CCAAGTGAGG AATTACGAAA ATTATACAAT AAAATTCTCG GCTCATTAGG TAGTAAAGCA AGTAAAGAAA CCCTTGAACA ACTTGAAAAA CTAGCAAAAG AAGCAAAAGA AACAGCTAAC AATTCTAAAC AGACAGCTGA CGATGCGTCC GCAGCAGCGC AAATAGCAAA GGATATCGCA GATGCTGTTT TAATAAAACA AAAAGATTTC CAAACGAAAA TTATAAAAAG TGCTACACCA CCCTCTAATC CTACTAAGGA TTTGACACTA TGGTTAGACA TAAGTAACCC GGAAAAGCCA ATCCTTTATC TGTGGAATGG AACGAAGTGG GATAGATTAA CTCCTGATAC TTCTATCATA GACGCTGATA TAAAAGGTAT TGAGGATGAG GTTGAGAAAC TCCAAACTGA AGTTGGCTCT AAAGTTAATC AACAATGGGT TAAGGACCAA ATTCAAACTG ACATCCAAAA TAAGGCTGAT ATCAAAGATG TTTACAAGAA AACTGAAATC GATAAAGCTT TAGAGGGTCA TGTTAAAGTA CAGTCGTATG AGATTGATAA AAAGGCTTTG CAGGAAGGTA TAGCAAATAA TGCAAACATA ATTAATACAA ATGACAAGAA CTATATCAAA CGTTTTACTG AAAATGAATC TAAGATTACT CAAACAGAAA AAGATATCAA AACACAAATT GAAGAATTGA GTGTTACGAA TAAAAAAGTT GATTCGCAAG GAAATACGAT TGATGAAGTT CAAAAGAAAA CAAATGAAAT TGTTCAAGAT GCTAATGGGA CAAAGCAAAC GATCACTTCT ATTCAGACTG AGCTAAAAAA TCAATTATCT ACAGCTAGAA ACATAATAAC TAATTCTAAT TTTAATGACA ATCTTAATAC TTGGAAAAAA TGA
|
Protein sequence | MRTPSGILHV VDFKTDQIIS VIQPKDYWDD RRQWELKNNV DMLEFKVFDG TPEAITLQQQ NLILKEVRDG RIVPYVINNE VERDSNDKSV TVHASGAWVQ IAKEGFISPQ RIESKTVNEF IDLALVGMKW KRGITEYAGF HTMTIDEFID PLTFLKNIAA LFKLEIQYRV EVKGSRIIGW YVDMIQKRGR ETGKEIELGK DLVGVKRIEH SREICTALIG FVKGEGDKVI TIESINKGLP YIVDADAFQR WNQHGQHKFG FYTPETDQED MTPQRLMTLM KLELAKRVNT FVSYEVEAQS LGRVFGLAHE LIDEGDIIRI KDTGFTPKLY LEARVIAGDE SFTDPTQDKY MFGDYREIID PSEELRKLYN KILGSLGSKA SKETLEQLEK LAKEAKETAN NSKQTADDAS AAAQIAKDIA DAVLIKQKDF QTKIIKSATP PSNPTKDLTL WLDISNPEKP ILYLWNGTKW DRLTPDTSII DADIKGIEDE VEKLQTEVGS KVNQQWVKDQ IQTDIQNKAD IKDVYKKTEI DKALEGHVKV QSYEIDKKAL QEGIANNANI INTNDKNYIK RFTENESKIT QTEKDIKTQI EELSVTNKKV DSQGNTIDEV QKKTNEIVQD ANGTKQTITS IQTELKNQLS TARNIITNSN FNDNLNTWKK
|
| |