Gene Haur_1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1226 
Symbol 
ID5733119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1418710 
End bp1419783 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content52% 
IMG OID641278366 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001544002 
Protein GI159897755 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAG CACCCTTGCT TGAGGTCAAA AACCTACAAG TTCAATTTAA AACTGCTGAT 
GGCGTGGTCA ATGCTGTCAA TAATGTTTCA TTCTCGGTCA ATCGCGGCGA AACCTTAGGC
ATCGTCGGCG AATCTGGCTC AGGCAAAAGC GTGACATCAC TTTCGATTAT GCGCTTGATT
CCCTCGCCCC CAGGCAAAAT TGCTGGTGGC CAGATTTTAT TTGATGGTGA TAATCTTGTC
GATTTTAGCG AGTCGGAGAT GCGTAAAATC CGTGGCAATC GGATTTCGAT GATCTTCCAA
GACCCCATGA CTTCGCTCAA TCCGGTGCTA CGGATTGGTC GGCAGATGAC TGAATCGCTG
CAATTGCACA TGGGGATGAC TCCCAAACAG GCGCGAAACC GAGCCATTGA CTTGCTCTCA
ATGGTTGGGA TTCCAGCTCC TGACAAACGG CTTGATGATT TTCCCCATCA ATTTTCTGGC
GGGATGCGCC AACGGGTGAT GATCGCTATG GGTTTGGCTT GTAACCCTGA GCTATTGATC
GCCGACGAGC CAACGACAGC ACTCGATGTA ACTATTCAAG CGCAAATTCT CGAATTGCTT
AACCGTCTGA AGAACGAGAC AGGCACGGCG ATTATTTTTA TCACCCACGA CCTTGGCGTT
GTGGCGGGCA TGACCGATCG GGTGATTGTG ATGTATGCTG GACGGGTGGT CGAACAGGCC
TCAACCAACG AGCTGTTCCA TAATCCCCGT ATGCCTTACA CCATCGGTTT GCTCGATTCG
ATTCCCCGGC TTGATGGAAT CCAAACGCGC CTTACGCCAA TCCCAGGGCT ACCACCGGAT
TTGCTGGAGA AAACCGAGCG CTGCCCATTT GCACCGCGCT GCGATTTTGT GCAAGAGCAA
TGTTGGAGCG AAACGCCGAG TTTGCGCCAA GTTGCGCCTG AGCATACCGC TGCCTGTTTA
TTCGAGATAG ATCGGGAACA GCGCCAAGCG ATGGCCGCCA AGAAGATTGC CGAAGAACAA
GCCGCCTTGG ATGCTGCGCT TGAAGATGTT TTAGCCCACG AACAGGCATC GTAG
 
Protein sequence
MAEAPLLEVK NLQVQFKTAD GVVNAVNNVS FSVNRGETLG IVGESGSGKS VTSLSIMRLI 
PSPPGKIAGG QILFDGDNLV DFSESEMRKI RGNRISMIFQ DPMTSLNPVL RIGRQMTESL
QLHMGMTPKQ ARNRAIDLLS MVGIPAPDKR LDDFPHQFSG GMRQRVMIAM GLACNPELLI
ADEPTTALDV TIQAQILELL NRLKNETGTA IIFITHDLGV VAGMTDRVIV MYAGRVVEQA
STNELFHNPR MPYTIGLLDS IPRLDGIQTR LTPIPGLPPD LLEKTERCPF APRCDFVQEQ
CWSETPSLRQ VAPEHTAACL FEIDREQRQA MAAKKIAEEQ AALDAALEDV LAHEQAS