Gene GBAA_pXO1_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_pXO1_0035 
Symbol 
ID2820220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007322 
Strand
Start bp31125 
End bp33056 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content39% 
IMG OID637682711 
Productgroup II intron reverse transcriptase/maturase 
Protein accessionYP_016366 
Protein GI47566357 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones77 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGCA ATTCTAAGTC CGCGCCAAAG GGGAAGAAAC TAAGACACAA CGAATACTAC 
GGTATTCAAC CTGTCCTAGA TAACTTATAC CAAAAAGCAA CAAAGGGAAA TTCTTTCAAA
AACCTAATGT CTATCATTAT ATCAGACGAA AATATACTTC TCGCCTACCG CAATATTAAG
GGGAACAAAG GAAGTAGAAC TGCAGCTTGT GATAATGTAA ATATAAAGAA TATTGAGGGA
ATGGAACAAA GCTATTTCTT GAATGAAGTT AAAAGACGCT TTCAAAACTA CCAACCGCAG
AAAGTAAGAC GTAAAGAAAT TTCGAAGCCC AACGGACAAA CCAGACCCCT GGGAATACCG
GCTATGTGGG ATAGGATAAT CCAACAGTGC ATCTTACAAG TCATGGAGCC AATCTGCGAA
GCGCACTTCA GTAACCGAAG TTATGGTTTT CGCCCAAACA GAAGTGCTGA ACATGCCCTA
GCAGATGCAT CAGTGCGAGT AAATAAACAA AACCTTACAT ATGTAGTAGA CGTAGACATT
AAAGGGTTTT TTGACGAGGT AAATCACGTC AAGCTCATGC GTCAATTATG GACATTGGGT
ATCCGTGACA AACAACTTCT GGTCATTATC CGAAAAATAC TGAAAGCCCC AGTGCAAATG
CCTGACGGCA CAACAATGTT CCCGACTAAG GGCACCCCAC AAGGCGGTAT TCTTAGTCCT
ATACTTGCCA ACGTCAATCT TAATGAATTC GACTGGTGGA TAAGTAGACA ATGGGAGACA
TTCAAAGCTA AAAAGGTAAA ACCGAGATGC ATGAGGGGAA TTTGGTGCAA TGACGTTGTA
ACGACACAAC TCACCAAAAC TTCCAAGATG AAACCAATGT ACATCGTAAG GTACGCAGAT
GACTTTAAAA TCTTCACAAA CACACGTAGT AATGCGGAGA AAATTTTCAA AGCGACTCAA
ATGTGGTTAG AAGAACGTCT AAAACTGTCT ATCTCAGCCG AAAAGTCTAA AGTAACCAAT
CTGACAAAGC AACAAAGTGA ATTCTTAGGT TTCACCCTCA AAGCTGTAAA GAAAGGTAAA
AAGAAGAACG GCGACACACG ATACATTGCA GTAACACACG TTTCCCCAAA AGCACTGGAA
AAAACAAAAC AAGATTTAGC AAAACAAGTG AGAAGAATAC AGAAAACCCC AAACTCTAAT
GAAACAATTA AGAGAATCAG CATATACAAC AGCATGGTCA TTGGTAAGCA CAACTATTAT
AAAATAGCTA CGCATGCCTC CCAGGATTTC AGTAAAATGA ACCATAATCT TGACCACATA
ATGTATAACA GATTCCCTAA GTCAACAACT GGAGGTAAAA GCAACACAAA TGGATACACG
AATATAGGAG AATATAAAGG AAAAGACAAA GGTATTAAAC CATATCTAAA GTCAAAAATG
ACGAGATTTC TCATGAAACG TCCCATCCTA CCAATCTCCT ATATTCAACA CAAAAACCCG
ATGATGAAAA AGCAAGCCAT TAACAAGTAC ACCGCAGAAG GACGAGCCCT GATACATAAA
AACTTGGCAG ACATAACCGA AGCGGAACTG AAATGGTTAA GAGAAAATCC AGTTATAAAT
GAACGGGCAA CCGTAGAATA CAATGACAAC CGAATTTCTC TTTATGTCGC ACAAAAAGGC
AAATGCAGTG TAACAGGTGA GAAACTCTTA CCTTGGGACA TTCATTGTCA CCATAAGCGA
TTATGGAGTG AAACAAAGGA CGACAGCTAC AAGAATCTTA CCATCATCAA ACCAAGTGTC
CATAGACTAA TACACGCAAC CAAGATAGAA ACCATAAACC AACTACTCAA TGAACTCAAA
TTCAATGAGG AACAGTTAGG CAAACTCAAT AAATTGCGAA AATTAGTCAA AAACGAGGAA
ATCTGTATTT AA
 
Protein sequence
MNGNSKSAPK GKKLRHNEYY GIQPVLDNLY QKATKGNSFK NLMSIIISDE NILLAYRNIK 
GNKGSRTAAC DNVNIKNIEG MEQSYFLNEV KRRFQNYQPQ KVRRKEISKP NGQTRPLGIP
AMWDRIIQQC ILQVMEPICE AHFSNRSYGF RPNRSAEHAL ADASVRVNKQ NLTYVVDVDI
KGFFDEVNHV KLMRQLWTLG IRDKQLLVII RKILKAPVQM PDGTTMFPTK GTPQGGILSP
ILANVNLNEF DWWISRQWET FKAKKVKPRC MRGIWCNDVV TTQLTKTSKM KPMYIVRYAD
DFKIFTNTRS NAEKIFKATQ MWLEERLKLS ISAEKSKVTN LTKQQSEFLG FTLKAVKKGK
KKNGDTRYIA VTHVSPKALE KTKQDLAKQV RRIQKTPNSN ETIKRISIYN SMVIGKHNYY
KIATHASQDF SKMNHNLDHI MYNRFPKSTT GGKSNTNGYT NIGEYKGKDK GIKPYLKSKM
TRFLMKRPIL PISYIQHKNP MMKKQAINKY TAEGRALIHK NLADITEAEL KWLRENPVIN
ERATVEYNDN RISLYVAQKG KCSVTGEKLL PWDIHCHHKR LWSETKDDSY KNLTIIKPSV
HRLIHATKIE TINQLLNELK FNEEQLGKLN KLRKLVKNEE ICI