Gene GBAA_pXO1_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_pXO1_0023 
Symbol 
ID2820337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007322 
Strand
Start bp22322 
End bp24031 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content36% 
IMG OID637682699 
Producthypothetical protein 
Protein accessionYP_016354 
Protein GI47566345 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones72 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAAT CATTTGTGAT ACGTAATCAA CGAAGCTTAA ATGGGGAAAC AACAAACGCA 
GTTTTAGTAA CTCTAAAATC TTGGCAAAAG TTTGTGGAAA CATTTGAGGA AAAGTATTAC
AGCTATAAAG ATACATACTT GGAAGATATT CTGCAGCACA TGACATACGA AGAACGTATT
GCTGACGTAG AAGGATTTGT CGAAAAAGCC AAACCATCTT TTTCACGTCG CGCAATGTCT
ACAATTGCAG AAAAAACAGG AAGAAAGCAT AAATTATATG GCATTACGAA TGCTGATCTC
GAAGTGTTCT TGTACTTACA TAAAAAATGC TATACAAATG GCGTCATTCC AAATGTAACA
ATACATATGA TGTGGGAAGA TTACAAGCAA TATAAAGAAG AGCTTGCATA CATTCAGCAT
TCTCAATTCT ATATCGCTTT GAAAAAGTTA AGCTTACATA ACATTATCTC TATTGAAAAT
GGCTTAGATG GCCGCTATAC AATCAAACTC ACTCATTTTA TGAACGAGGA AACAGAAAAA
GCAAATCCTT ACGTATATAT TAGTCCTGTT GTATTTACAA AGGCTTTCTT TAAGCTATCT
GTAGCAGCTA AGAAACTATT CTTAGACATC GCAATGCAAC AACATACGGA AACAACATTA
AAGCGTTCTT TGGACAAGCA GGACGAAAGA GGAAACAAAA CTCACTTCGG TGGGATGTAT
CGTTTCTTAC ATAAAAAATA TCCGCATCAG ATCCGTACGG TTATCGAGGA ATTAACAACT
GCATTACCAT GTACGGGGAA TCCCCTATTC AAAATTTGCA AAATGCAAAA AGGGGTAAAG
CATACAAAAC GATATACGAC ATTATATCTG TCTATCCATT CAGACTTCTT ATGTTCGAAA
GAAGCTGGAG AAGAACAACA TAGGGATCCA TTTACACCAA AAGCTACCTA TGCTCGAAAA
GCGAAATTTA TAGAAACAGT CCTACAAGAG ATGAATATCG GTGAATTGTC TGCAGACATG
AATAAGTTCA TTCATGTATT AAAGCATACT TGTCATCGTC AAATTCGCAG TGTAATCCGT
GGACTACGTG ACATGGTTGA TCGAAAAGAA GGATATCCAA CGAAAATTGT ATATACATTG
AAGAAACTAT TACATCAAAC TTCTCAATAT CAAATTCTCG ATACAGCAGC AAAAGAAGGT
ATCTACCCTC TTATTGCGCA ACATATACCA AAAGAGAGAA ATAGTGACCG TGAACAAGCT
GTCTTTAATT TTGGTCTACA TTATTCTATG TACTCTCTTC ATAACATTAA GAAGATGTTT
AAGAATGTGC ATGCACTACT GAAGCAGAAG TTTGCGGTAC CAGTGACAGA AGAGTCTTAT
CACCGTAATT ATCTAAAATA CCAGGAAGAA ACGTTATTTA GAAAATATGC ATATGATCAG
GGCGTAAATC TCCATGCATA TATCGCTTTA GAAATCGAAA TGCGTGAAAA ACTAAAAGTT
CGTGGCCATA AAGACCGCAC GATTCCAAGT GATGTACGTG AATGGTTTAT TGAAGAAATT
GATAAGCTAC CGCAAGAACA GCTGCGTGTG ATTGAGCTAC CAAAACAGTT TAATTTACTA
GAGTTTATGC GTACCTTCGA ACGTTTAGTA CGTGCTGGTG TAACAATAAT AGCTCCAGAT
CAGGTGCTAC ACGCAATGGA AATAAAATAA
 
Protein sequence
MGKSFVIRNQ RSLNGETTNA VLVTLKSWQK FVETFEEKYY SYKDTYLEDI LQHMTYEERI 
ADVEGFVEKA KPSFSRRAMS TIAEKTGRKH KLYGITNADL EVFLYLHKKC YTNGVIPNVT
IHMMWEDYKQ YKEELAYIQH SQFYIALKKL SLHNIISIEN GLDGRYTIKL THFMNEETEK
ANPYVYISPV VFTKAFFKLS VAAKKLFLDI AMQQHTETTL KRSLDKQDER GNKTHFGGMY
RFLHKKYPHQ IRTVIEELTT ALPCTGNPLF KICKMQKGVK HTKRYTTLYL SIHSDFLCSK
EAGEEQHRDP FTPKATYARK AKFIETVLQE MNIGELSADM NKFIHVLKHT CHRQIRSVIR
GLRDMVDRKE GYPTKIVYTL KKLLHQTSQY QILDTAAKEG IYPLIAQHIP KERNSDREQA
VFNFGLHYSM YSLHNIKKMF KNVHALLKQK FAVPVTEESY HRNYLKYQEE TLFRKYAYDQ
GVNLHAYIAL EIEMREKLKV RGHKDRTIPS DVREWFIEEI DKLPQEQLRV IELPKQFNLL
EFMRTFERLV RAGVTIIAPD QVLHAMEIK