Gene GBAA_4978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_4978 
Symbol 
ID2816723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4517884 
End bp4519293 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content50% 
IMG OID637791646 
Producttriple helix repeat-containing collagen 
Protein accessionYP_021626 
Protein GI47530277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAGGAA ATGGTGGTAA ATCCAAAATA AAAAGTCCAT TAAATTCTAA TTTCAAGATA 
TTGTCAGATC TAGTTGGCCC TACTTTTCCT CCAGTTCCAA CTGGAATGAC AGGGATAACG
GGAAGTACGG GAGCAACGGG AAACACGGGT CCAACGGGAG AAACGGGAGC AACGGGAAGC
GCCGGGATAA CAGGAAGTAC GGGTCCAACG GGAAACACGG GAGGAACAGG AAGCACAGGT
TCAACGGGAA ACACGGGAGC AACAGGAAGT ACTGGGGTAA CAGGAAGCAC CGGGGTAACA
GGAAGTACGG GAGTAACAGG AAGTACTGGG GTAACAGGAA GTACGGGTCC AACAGGAGAA
ACGGGAGGAA CAGGAAGTAC TGGGGTAACA GGAAGTACAG GGGCAACAGG AAGCACCGGG
GTAACAGGAA GTACGGGAGT AACAGGAGAA ACAGGTCCAA CGGGAAGTAC GGGAGCAACG
GGAAACACGG GTCCAACAGG AGAAACGGGA GGAACAGGAA GTACAGGGGC AACAGGAAGC
ACTGGGGTAA CAGGAAATAC GGGTCCAACA GGAAGCACCG GGGTAACCGG AAATACGGGA
GCAACAGGAG AAACAGGTCC AACAGGAAAT ACGGGAGCGA CGGGAAATAC CGGTCCAACA
GGAGAAACGG GAGTGACAGG AAGTACGGGT CCAACAGGAG AAACGGGAGT GACAGGAAGT
ACGGGTCCAA CAGGAAACAC GGGAGCAACA GGAGAAACGG GAGCAACAGG AAGTACTGGG
GTAACAGGAA ACACGGGTTC AACAGGAGAA ACAGGTCCAA CGGGAAGTAC GGGTCCAACA
GGAAGCACCG GAGCAACGGG AGTGACGGGA AACACAGGTC CAACCGGAAG CACCGGAGCA
ACGGGAGCAA CAGGAAGCAC AGGTCCGACC GGCAGCACCG GAACAACAGG AAATACGGGA
GTAACAGGAG ATACCGGTCC AACAGGAGCG ACCGGGGTTA GTACAACTGC AACGTACGCG
TTTGCGAATA ATACATCAGG AAGTGTTATT TCTGTTTTGT TAGGTGGCAC GAATATTCCG
TTACCAAACA ATCAAAATAT TGGACCGGGA ATAACTGTTA GTGGTGGGAA TACTGTATTT
ACAGTTGCGA ATGCAGGGAA TTATTATATA GCCTATACAA TTAATTTAAC AGCAGGCTTA
CTTGTAAGTT CCCGTATAAC TGTAAATGGC AGTCCGCTTG CGGGAACGAT AAACTCCCCG
ACAGTGGCTA CTGGTTCATT TAGTGCAACA ATAATTGCTA GCTTGCCTGC TGGAGCTGCC
GTTAGCTTAC AACTATTTGG AGTAGTTGCG TTGGCTACAT TATCTACGGC AACGCCAGGA
GCTACTTTAA CGATTATTAG ATTGAGTTAA
 
Protein sequence
MEGNGGKSKI KSPLNSNFKI LSDLVGPTFP PVPTGMTGIT GSTGATGNTG PTGETGATGS 
AGITGSTGPT GNTGGTGSTG STGNTGATGS TGVTGSTGVT GSTGVTGSTG VTGSTGPTGE
TGGTGSTGVT GSTGATGSTG VTGSTGVTGE TGPTGSTGAT GNTGPTGETG GTGSTGATGS
TGVTGNTGPT GSTGVTGNTG ATGETGPTGN TGATGNTGPT GETGVTGSTG PTGETGVTGS
TGPTGNTGAT GETGATGSTG VTGNTGSTGE TGPTGSTGPT GSTGATGVTG NTGPTGSTGA
TGATGSTGPT GSTGTTGNTG VTGDTGPTGA TGVSTTATYA FANNTSGSVI SVLLGGTNIP
LPNNQNIGPG ITVSGGNTVF TVANAGNYYI AYTINLTAGL LVSSRITVNG SPLAGTINSP
TVATGSFSAT IIASLPAGAA VSLQLFGVVA LATLSTATPG ATLTIIRLS