Gene GBAA_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3787 
Symbol 
ID2818678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3478491 
End bp3480179 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content35% 
IMG OID637790514 
Productprophage lambdaba01, terminase, large subunit 
Protein accessionYP_020424 
Protein GI47529075 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.26855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGATAC AATATCTTTT GTCTTATAAC CCAATCATTG AATATTACAG CCTTATTGAA 
TCGGGAAAAG AAATTGTTAG TGAAAAAGTT CGTAGAATAT ATAAGAAATT AGTAAGTGAT
ATTGATGATA AAGAAAGTAT ATATGAATAC GACTCAAAGA AAGCAAACCA TGCCATTGAG
TTCATCGAAA ACTTTTGTAA GCACTCAAAG GGAAAATGGG GCGGAAAACC AATTGTTTTA
GAAGTATGGC AAAAGGCATT TATTGCAGCA GCATTTGGAT TTGTACATGG AATAGATGGC
ACAAGAAAAT ACAGAGAAGT ATTACTTGTA GTTGCTCGTA AAAATGGAAA ATCTACTGTT
GGCTCAGGTA TTGGATTGTA TTTGCAAATA GCAGATGGGG AACCAGGTTC AGAAGTTTAT
GCGGTAGCAA CTAAGAAAGA CCAAGCGAAA TTAGTTTGGT TAGAATCAAA GCGAATGGTA
AAGAAGTCAC CAGCACTATT AAAGCGTATT AAACCTTTAG TATCTGAAAT GGTTTCTGAA
TGGAATGATA GTACATTTAA ACCACTTGGC TCTGATAGTG AAACTTTAGA TGGACTTAAC
GTACATGGTG CTATGATGGA TGAAATCCAT GCTTGGAAAG ATAAAAACTT ATATGACGTT
ATTGTAGATG GTACTTCTTC ACGAGAACAG CCAATGATAT TTATGATTAC AACAGCCGGA
ACTGTCCGAG AGTCAGTGTA TGATATGAAA TATGAAGAAG CAGAAATGTT GTTAAATGGA
CTCGATGATC CGGATGGATA TAAGGATGAT CGTTTTTTAC CTATAATCTA TGAGTTGGAT
AAACGAGAGG AATGGACTGA CCCATCAAAG TGGAAGAAAG CAAATCCTGG GCTTGGATCG
ATAAAAAAGA TAGACCAACT TGAAACAAAA GTAAACAAGG CAAAAGCAAA TTCTTTGTTG
GTAAAAAACT TACTAACGAA AGATTTTAAT ATAAGAGAAA CTTCAACAGA AGCATGGCTG
ACTTTTGAAC AACTAAATAA TCCTGAAACT TTTGATATAG AAAAGCTAAA GCCTTCCTAT
GGAATTGGTG GTTGCGATTT ATCTTCAACT ACCGATTTAA CAGCAGCGAA GGTTATTTTT
ATGGTTCCAG AGGACCCACA TATTTATGTG AAGCAGATGT ATTGGCTTCC AGAAGATTTA
TTGGAGCAGC GAAGTAAAGA AGATAAAATT CCATATAATT TATGGCACGA GCAAGGAATA
TTAAGAACAA CACCGGGAAA TTCCGTTCAT TATAAATTTG TTACGAAATG GTTTTTAGAA
ATACGAGATG AATATGGCAT TTATCTACCT TGGATTGGTT ATGATAAGTG GTCAGCTAAG
TACTGGGTTG AGGAAATGGA AGGTTATTTT GGTAAAGAAT CTATGATTCC TATCGCACAA
GGTAAACAAA CTCTTTCTAG CCCGATGAAA CTTTTAGGAG CTGATTTGGA ATCTAAGTTA
ATAAACTATA ATAACCACAC AATTGACAAG TGGTGTCTTT CCAACACAGC CATAGACGTT
GATAAAAATT TAAATATACA ACCAAATAAA ACAAAGAACC AACGACGTCG TATTGATGGT
ACAGCAGCGC TTTTAAATGC ATATGTAGTT CTTCAAGAAA AACGAAATGA CTACCTCAAC
ATGATTTAA
 
Protein sequence
MRIQYLLSYN PIIEYYSLIE SGKEIVSEKV RRIYKKLVSD IDDKESIYEY DSKKANHAIE 
FIENFCKHSK GKWGGKPIVL EVWQKAFIAA AFGFVHGIDG TRKYREVLLV VARKNGKSTV
GSGIGLYLQI ADGEPGSEVY AVATKKDQAK LVWLESKRMV KKSPALLKRI KPLVSEMVSE
WNDSTFKPLG SDSETLDGLN VHGAMMDEIH AWKDKNLYDV IVDGTSSREQ PMIFMITTAG
TVRESVYDMK YEEAEMLLNG LDDPDGYKDD RFLPIIYELD KREEWTDPSK WKKANPGLGS
IKKIDQLETK VNKAKANSLL VKNLLTKDFN IRETSTEAWL TFEQLNNPET FDIEKLKPSY
GIGGCDLSST TDLTAAKVIF MVPEDPHIYV KQMYWLPEDL LEQRSKEDKI PYNLWHEQGI
LRTTPGNSVH YKFVTKWFLE IRDEYGIYLP WIGYDKWSAK YWVEEMEGYF GKESMIPIAQ
GKQTLSSPMK LLGADLESKL INYNNHTIDK WCLSNTAIDV DKNLNIQPNK TKNQRRRIDG
TAALLNAYVV LQEKRNDYLN MI