Gene Haur_1675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1675 
SymbollysS 
ID5733559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1940893 
End bp1942371 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content51% 
IMG OID641278814 
Productlysyl-tRNA synthetase 
Protein accessionYP_001544446 
Protein GI159898199 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.100386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAA ACGATTTACA GCAAACACGC TATGGCAAGC TACAGGCGCT GCAAGCCGCT 
GGCATCGAGC CATATCCAGC CCGAGTGCCG CAACGCACTC ATACATTAAC CGCCGTGCGT
GAGCAATTTT CGGCCCTGGT TGAGGCCAAT GCCACGGTAA CAATTATGGG GCGCTTGCGC
CAACGTCGCG TTATGGGCAA ATCAGCGTTC GCCCATTTAA ATGATGATCA TGGCGCGTTT
CAAATTTTCC TCAGCAAAGC CGATGTTGGC GATGAGCCAT TCAAGCATTT TGTTGATCTG
ACTGATCTTG GCGATATTAT TGCGGTCACA GGCACGCTCT TTACGACCAA AATGGGCGAA
CCAAGCGTAC ATGTCACCAG CTGGACGATG CTCAGCAAGG CGATCACGCC GCCACCCGAC
AAACGCGAAG GTCAATTTAG CGACCAAGAA GCTCGCCAAC GCCAACGCTA TGTTGACTTA
TCCGCCAATC CTGAAGTTCG CGAAATCTTC CGGATTCGCG CTCGTTTGAT CACGGCAATG
CGGCGCTACC TCGATGAACG CGGCTTTTTG GAAGTTGAAA CGCCAGTATT GCAGGGGATT
TATGGTGGCG CAGCGGCGCG ACCATTCACC ACCCATCATA ATCAATTGCA CCAAGATTTA
TACCTGCGGA TCGCCACCGA GCTCTATTTG AAGCGCTTGA TCGTTGGCGG CTTCGATGGT
GTGTATGAAA TTGGCAAAAA CTTCCGCAAC GAAGGCGTTG ATCGCACCCA TAACCCCGAA
TTTACCATGA TCGAGGTCTA TCAAGCCTAC GGCGATTATG AATCGATTAT GCAATTAACC
GAGGGCATGA TTCGCTTCGC TGCTGAGCAA ATTTTTAACA GCACCAGCAT CGAATACCAA
GGGCATCAGA TCGAGCTTGG CGGTTCGTGG CAGCGCTTGA CCATGCGCGA TGCCATTTTT
GAAAAAACCG GGGTTGATAT TCGCGAGTGC CGCGAATTTG ATACACTATG GGAAGCAATT
GGCGAAGCTG GCCTGAAAAT TGAGCGCAAG CCAACCTGGG CCAAGCAAGT TGATGAGCTA
TTTAGTGAGT TTGTTGAGCC TGAGTTGATT CAGCCAACCT TTATCACCGA ATACCCTCAG
CCACTTTCGC CTTTGGCCAA GCGCAAAGCC GATGATCCAC AGTTTGTCGA GCGCTTTGAG
CTATTTATGC TTGGAGCCGA AATTGCCAAC GCCTTCAGCG AATTAAACGA TCCCTTCGAT
CAAGAGCAAC GCTTCTTGGA GCAAGGCCGC GATTATGCTG CTGGCGATGA CGAAGCCATG
CAAATGGACG AAGATTACCT TGAGGCGCTT AAAGTTGGTA TGCCACCAAC TGGCGGTTTA
GGCATCGGGA TCGATCGGCT ATGTCTGTTA TTTACCAATC AAACTACGAT TCGTGAAGTA
ATCTTCTTCC CGCATTTGCG CAAGCAGGGC GAGGAGTAG
 
Protein sequence
MELNDLQQTR YGKLQALQAA GIEPYPARVP QRTHTLTAVR EQFSALVEAN ATVTIMGRLR 
QRRVMGKSAF AHLNDDHGAF QIFLSKADVG DEPFKHFVDL TDLGDIIAVT GTLFTTKMGE
PSVHVTSWTM LSKAITPPPD KREGQFSDQE ARQRQRYVDL SANPEVREIF RIRARLITAM
RRYLDERGFL EVETPVLQGI YGGAAARPFT THHNQLHQDL YLRIATELYL KRLIVGGFDG
VYEIGKNFRN EGVDRTHNPE FTMIEVYQAY GDYESIMQLT EGMIRFAAEQ IFNSTSIEYQ
GHQIELGGSW QRLTMRDAIF EKTGVDIREC REFDTLWEAI GEAGLKIERK PTWAKQVDEL
FSEFVEPELI QPTFITEYPQ PLSPLAKRKA DDPQFVERFE LFMLGAEIAN AFSELNDPFD
QEQRFLEQGR DYAAGDDEAM QMDEDYLEAL KVGMPPTGGL GIGIDRLCLL FTNQTTIREV
IFFPHLRKQG EE