Gene Hneap_1511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1511 
Symbol 
ID8534669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1643884 
End bp1645074 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content59% 
IMG OID646383901 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_003263389 
Protein GI261856106 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00343011 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGAG TTGAAGACCC CCAATGGCAA CCCCAAACCC GGGCGATCCG GGTTGGCCAT 
CACCCAACGA ATGAAGGTGA ACACGGCGAG CCGATCTTCA CCACATCAAG CTTTCAGTTC
GAATCGGCCG AGCAGGCTGC GGCCCGTTTT TCCGGCGCCG AACCCGGCAA TATCTATGCA
CGGTTTACCA ACCCGACGAC CAAGGTGTTC GAAGATCGGT TGGCGGCCCT TGAAGGGGGC
GAATCCTGCG TGGCGACCGG TTCCGGCATG GCCGCCATTC TCAGTACCTT CATGGCCTTG
TGCTCGGCGG GCGATGAAGT GGTGGTGGCC CGACAGGTGT TCGGCACCAC TTCCGTGCTG
TTCAACAAAT ACCTCGCGAA ATTCGGCTTG AAGGTCAAAT GGGTCGATTT GACCGACTGG
TCGCAGTGGG AAGCCGCCAT CACGGACCTG ACCCGCTGGG TGTTCGTGGA AAGCCCATCC
AATCCGTTGA CCGAAGTGGT CGATATCGCT CGTCTGGCAG AGTTGGCGCA TAAGCATGGT
GCCGGTCTGA TCGTGGATAA TTGCTTCTGC ACGCCCATAC TCCAGCAGCC GTTGGCATTG
GGTGCGGACA TCGTCATCCA TTCCGCCACG AAGTTTCTCG ATGGGCAAGG CCGGGCCATA
GGTGGCGCTG TCGTCGGCAA TAAGAAATTA GTGGGTGAAG AGGTGCGCGG TTTTCTGCGG
ACCTGCGGCC CCACTATGTC ACCGTTCAAT GCTTGGATTT TTGCAAAAGG CTTGGAGACC
TTGGCCCTGC GCATGAAAGC GCACTGCGCC CACGCCAGCG CCGTCGCGGA TTTCCTGGCG
GCTCACCCTC AGGTCAAACG CGTCTATTTC CCCGGGCTGT CCAACCACCC GCAAGCGGAC
ATCATCGCCA GACAACAGTC AGGCCCGGGC GCGATCGTGT CCTTCGAGGT CGAAGGCGGG
CAGGCGGCGG CATGGCGGGT AATCAATGCC ACGCAAATGA TTTCCATCAC AGCGAATCTG
GGTGATGCCA AAACGACCAT CACCCATCCG GCCACCACCA CGCACGGACG TTTGACGCCC
GAGCAGCGTA AAGAATCGGG TATTCATGAT GGGCTCGTAC GTCTGGCTAT TGGCCTTGAA
GATCCCATCG ACATCATCCG GGATCTCAAG CGAGGCCTTG ATCGTGAATG A
 
Protein sequence
MNRVEDPQWQ PQTRAIRVGH HPTNEGEHGE PIFTTSSFQF ESAEQAAARF SGAEPGNIYA 
RFTNPTTKVF EDRLAALEGG ESCVATGSGM AAILSTFMAL CSAGDEVVVA RQVFGTTSVL
FNKYLAKFGL KVKWVDLTDW SQWEAAITDL TRWVFVESPS NPLTEVVDIA RLAELAHKHG
AGLIVDNCFC TPILQQPLAL GADIVIHSAT KFLDGQGRAI GGAVVGNKKL VGEEVRGFLR
TCGPTMSPFN AWIFAKGLET LALRMKAHCA HASAVADFLA AHPQVKRVYF PGLSNHPQAD
IIARQQSGPG AIVSFEVEGG QAAAWRVINA TQMISITANL GDAKTTITHP ATTTHGRLTP
EQRKESGIHD GLVRLAIGLE DPIDIIRDLK RGLDRE