Gene Haur_5021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5021 
Symbol 
ID5736980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp28031 
End bp29050 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content57% 
IMG OID641282188 
ProductRimK domain-containing protein ATP-grasp 
Protein accessionYP_001547779 
Protein GI159901533 
COG category[H] Coenzyme transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0189] Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.548903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCTTG TTCTCACCAG TCCAGATGAT ACTCATGCTG ATCGGGTCTG TGATCTGCTG 
GACCAAGCCG ATGCTCCTTG GTTTCGGTTC GATCCGGCCG CCTTTCCCCA CTCTGCCCAG
TTGACCGTGA CCACGGGCGC AACGGGCTTG GTCCAGCGTC TCCTTGTTAC GAGAGATCAT
AGCCTAGATT TGGCGCAGGT CACAGCGCTC TGGTACCGCC GTCCCCAAGC ACCAGTGGTG
GAAGTGCCAG GGATCGACCC ATCTCATAAT GCGGCGTTGG CGGAAGAATG CCAACACTTG
GTTCGTGATC TATGGGAGAC ATTGGCGTGC TTGATGGTGC CGGCATCGTA CTGGGTTATC
CAGCGAGCAC AACACAAGAT CTCCCAACTT CAGTTGGCCA CAGCCCTGGG GTTTGAACTC
CCGCCAACAA TGGTCACCAA TGACCCATCG GCACTGATCG CTTTCTCTCG CGCCCACAAT
GGGCAAATCA TTAGCAAACC GTGTATGGGA CTCGCCCTGC AACGGACGGG CTACTATCAA
TACACACGTC CCGTCACCCG CCGAGACCTT GCTGCAGCCG AGACCATTCA CGCATCGCCC
ATGATTTTCC AGAAACTGGT GCCTAAGGCG GTTGAGGTAC GCATCACCGT TGTCGGCGAT
CAGGTATTTG CCGTCGCGAT CCACTCCCAG GTCTCCCATC ATACCCGCTA TGACTGGCGG
CGCTACGACC ATGATCATAC GCCGCTTACA CCGCATACCC TGCCCCCCGC ACTGGCGGCC
CAGTGTGTGG CACTGGTGGC TCGCATGGGC CTGACCTACG GAGCCATTGA TATGATTCTT
ACGCCTAACG GACGATATAT CTTTTTGGAA ATCAATCCTA ATGGACAGTA TCTTTGGGTT
GAATCCCGCA CCGGCGTTCC TATCAGCGCG GCCATCGCCC GTCTGCTCCA GACCGGCACC
CACCGCACTG TCGCCAATCT AGTCGTGTCC GAGGGAGGAA TAGCATGCCT CCTGGAGTAA
 
Protein sequence
MILVLTSPDD THADRVCDLL DQADAPWFRF DPAAFPHSAQ LTVTTGATGL VQRLLVTRDH 
SLDLAQVTAL WYRRPQAPVV EVPGIDPSHN AALAEECQHL VRDLWETLAC LMVPASYWVI
QRAQHKISQL QLATALGFEL PPTMVTNDPS ALIAFSRAHN GQIISKPCMG LALQRTGYYQ
YTRPVTRRDL AAAETIHASP MIFQKLVPKA VEVRITVVGD QVFAVAIHSQ VSHHTRYDWR
RYDHDHTPLT PHTLPPALAA QCVALVARMG LTYGAIDMIL TPNGRYIFLE INPNGQYLWV
ESRTGVPISA AIARLLQTGT HRTVANLVVS EGGIACLLE