Gene Haur_2393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2393 
Symbol 
ID5734274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3049579 
End bp3050622 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content53% 
IMG OID641279534 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001545161 
Protein GI159898914 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAAA CTGTGTTAAC TGCTGAGCAG TGGAAATTAG CTGAATCTCC CGACCCGGTG 
GTCTTGGAAG TGCGCAATTT GACCAAACGC TTTCCAGTTG GTGGTCTGTT TCGCAGCAAA
CAGGTCCATG CCTTAACCGA TGTGTCATTT GCAATTCGCC GTGGTGAGGT GTTGGCGGTC
GTGGGCGAAT CAGGCAGCGG CAAAAGCACC GCCGCTCGGT TGATCGCTCG CTTGATGGAG
CCAACCAGTG GCGAGATTAT CTTCCGTGGC CAAAATGTGC TGCAAACTGA GAAACGTGGT
GCTTCGCTGA GCTATCGCAG CGGCGTACAA ATGATTTTTC AAGATCCATT TGGCTCGATG
AACCCAACCC ACTCGGTGGC GCATCACATT ATGCGACCAT TGCAAATTCA TCATAAAGTT
GAGCGACGCA GCGATTTGTT GCCACGAGTG CATGAGTTGC TGGCGACGGT CGGCCTGAAT
CCACCAGCCG ATATTGCCAA TAAATACCCA CACGAGTTAT CTGGTGGGCA ACGTCAACGG
GTGGCGATCG CCCGCGCCTT GGCCGTTGAT CCGGAAATCG TGCTGGCCGA CGAACCAATT
TCGATGCTCG ATGTTTCGAT TCGGATTGGC GTTTTGAATT TGATGGCCAA GCTCAAAAAA
GAGATTGGCA TCGGCTACCT CTACATTACC CACGATATTG CCAGCGCCCG CTATTTTGCC
GACCGGATTA TGGTGTTGTA TGCAGGCCAA ATGATGGAAG GTGCTGATAG TGACGAGTTG
ATCGGCAACC CCGCCCATCC CTATACAAAA TTGTTGCTTT CGGCTGTGCC AAACCCCGAA
GTGGCGCTTG GTCAGCGTGA AGTTGTCGCC CGTGGTGAGC CGCCTTCCTT GATCGATCCG
CCGCCTGGTT GCCCATTTGC GGCGCGTTGC CCTCAAGTCA AGGATGTTTG TCGTAAAGTA
ATGCCCGATG TGCAGACGAT TGCGCCAAAT CACTGGGTTC GTTGCCATTT GTATGGTGAG
GGCACTGGAG GAACTGCGGC ATGA
 
Protein sequence
MTKTVLTAEQ WKLAESPDPV VLEVRNLTKR FPVGGLFRSK QVHALTDVSF AIRRGEVLAV 
VGESGSGKST AARLIARLME PTSGEIIFRG QNVLQTEKRG ASLSYRSGVQ MIFQDPFGSM
NPTHSVAHHI MRPLQIHHKV ERRSDLLPRV HELLATVGLN PPADIANKYP HELSGGQRQR
VAIARALAVD PEIVLADEPI SMLDVSIRIG VLNLMAKLKK EIGIGYLYIT HDIASARYFA
DRIMVLYAGQ MMEGADSDEL IGNPAHPYTK LLLSAVPNPE VALGQREVVA RGEPPSLIDP
PPGCPFAARC PQVKDVCRKV MPDVQTIAPN HWVRCHLYGE GTGGTAA