Gene BAS2956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2956 
Symbol 
ID2852147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2926123 
End bp2927553 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content35% 
IMG OID637506200 
Productdeoxyribodipyrimidine photolyase family protein 
Protein accessionYP_029213 
Protein GI49185961 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.13805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATA AAATTATCGT TATGTTTCAA AAAGATTTTC GCTTATATGA TAACCCAGCT 
CTATTTGAAG CGGCTCAGTC CGGTGAGGTT GTTCCGGTAT ATGTACATGA TGAAACTTTT
TCAATGGGAA GTGCGTCAAA GTGGTGGTTA CACCATGCAA TAATAGATGT AAAGAAGCAA
CTTGAGGCAT TGGGCTCTAC TTTAATCATT CGTAAAGGAA GTACGCAAGA AGAAATACTT
TCTCTCGTAG AACAGTTAGG TATAACGGCT GTATATTGGA ATATTTGTTA TGATCCGGAC
AGATTACAAT CTAATCAAAA AATGAAAATG ATGTTAGAAC ATAAAGGTAT GATCTGTAAG
GAATTTAATT CACATTTATT ATTAGAGCCT TGGGTTATTA AAAAGAAAGA TAACACTGAA
TATAAGGTGT TTACGCCTTT TTACAATGCA TTTCAAAAGC AGGTAATACA TAAGCCAATT
AGTAAAGTGC AGAGTATAAA GGGAGGAAAC TCTTTACCAG TAAGCTTATC TGTTTCAGAA
TTACACTTGT TGCCGACTAT ACCGTGGACA TCTCATATGG AATCAATATG GGAGCCTACA
GAAGAAGGGG CATACAAAAC ATGGAAGGAA TTTTTCTCTA GCAAATTGGC CTCTTATAGT
GAAGGAAGAG ATTTTCCAAA TCAAAATGCT CATTCAATGT TGGCGCCTTA TCTTTCATTT
GGTCAAATAT CAGTCAAGCT AATCTATCAT TACTTAATAA ATAAAAGTAC AGAAAGCCAA
TGTAGTCTTT TTGAAAAACA AGTAAATAGT TTTATACGTC AATTAATTTG GCGAGAGTTT
TCTTATTATT TGCTATATCA TTATCCGTTT ACAGCATATA AACCTCTTAA TAAGAGCTTT
GAACATTTTC CGTGGAATAA TGAAGAGGAG TTATTAAGAG TATGGCAGAA AGGTGACACT
GGTTATCCGT TTATTGATGC AGGAATGAGG GAACTGTGGC AAACAGGTTT TATGCATAAT
CGCACAAGAA TGGCTGTAGC CTCTTTTCTT GTAAAGCATT TGTTAATTCC GTGGCAAGAA
GGAGCAAAAT GGTTTATGGA TACACTATTA GATGCTGATA TTGCAAATAA TACAATGGGG
TGGCAATGGG TTGCTGGAAG TGGAGCAGAT GCATCACCAT ACTTTCGTAT TTTTAATCCG
ATCACACAAG GAGAAAAGTT TGATAAAAAC GGAGAGTATA TAAGAAAATG GGTACCAGAA
TTAAAAGATA TGCCTAATAA ATATATACAT AAACCGTGGG AAGCACCTGA GCATATTTTA
CAAAAGGCCA ATATACAGCT TGGTCATACA TATCCTTTGC CAGTCGTTGA TCATAAGGCA
GCACGAGAGA GAGCGCTTTG TGCATATAAA AGTATGAAAG AATTCGTATG A
 
Protein sequence
MQNKIIVMFQ KDFRLYDNPA LFEAAQSGEV VPVYVHDETF SMGSASKWWL HHAIIDVKKQ 
LEALGSTLII RKGSTQEEIL SLVEQLGITA VYWNICYDPD RLQSNQKMKM MLEHKGMICK
EFNSHLLLEP WVIKKKDNTE YKVFTPFYNA FQKQVIHKPI SKVQSIKGGN SLPVSLSVSE
LHLLPTIPWT SHMESIWEPT EEGAYKTWKE FFSSKLASYS EGRDFPNQNA HSMLAPYLSF
GQISVKLIYH YLINKSTESQ CSLFEKQVNS FIRQLIWREF SYYLLYHYPF TAYKPLNKSF
EHFPWNNEEE LLRVWQKGDT GYPFIDAGMR ELWQTGFMHN RTRMAVASFL VKHLLIPWQE
GAKWFMDTLL DADIANNTMG WQWVAGSGAD ASPYFRIFNP ITQGEKFDKN GEYIRKWVPE
LKDMPNKYIH KPWEAPEHIL QKANIQLGHT YPLPVVDHKA ARERALCAYK SMKEFV