Gene Haur_0351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0351 
Symbol 
ID5732261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp421892 
End bp422866 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content50% 
IMG OID641277474 
ProductABC transporter related 
Protein accessionYP_001543130 
Protein GI159896883 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0304446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACAA TCGAGTTTCG CAATGTTCGT AAAATCTACC CCAAGGCCGC AACCGCCGCG 
CTCGCCGATA TTTCTCTGAC AATCAATCGC GGTGAATTTG TGATTTTGCT TGGGCCTTCG
GGCTGCGGCA AAACCACGCT CATGAAAATG ATCAATCGCT TGATCGAGCC AAGTAGCGGC
ACAATTTTGC TCGATGGAGT CGATATTCAG TCGATTAATG CCACGAAATT GCGCCAGCAA
ATTGGCTATG TGATTCAGCA AGTTGGGCTT TTTCCGCATA TGACGGTCGC TGAGAATATC
GCAGTTGTCC CCAAATTGCT TGGCTGGCCC AAAGCCAAAA TCCAAAGCCG CATCGACGAA
TTACTCCAGC TCATTCAGCT TGATCCTGCC CAATTTCGCC AGCGCTACCC GGCCCAAATT
TCCGGTGGGC AAGCCCAGCG CGTTGGTTTG GCCCGTGCTT TGGCCGCTGA TCCTGGGGTG
ATGTTGATGG ATGAACCATT TGGCGCGATC GATGCGATCA CCCGCACGGC CTTGCAAGAT
GAGATGCTGC GCATTCAGCA GCAATTGCAG AAAACTATCG TTTTTGTGAC TCACGATGTT
GAAGAAGCGC TTCGCTTAGC TGATAAAATT GCAATTCTGC ACGAAGGCAC GATCGTTCAA
TACGATACTC CGCTCAATTT ATTGCGTAAT CCCGCTAATC ACTATGTCGC TGAGTTACTG
GGAGCCGATG ATTTGGTGCG CCGTTTGAGT TTAATTCAGG TGCGCCATGT GCTTCAACCA
CTGCCTGCAA GCTATGATTC CTCGTTAGCA AGCATCGAAA GTAGCCGCAA CCTGCGCGAT
GGGCTGAATC AACTCCTCGC CAGCAGCGAC GAACAATTGT TGGTCGTTGA ACACAATCAA
CCAATTGGTA TGCTCTCATT AGCCGCAATT CATGCCTATT TGCACCCTGA GGTCATCCAT
GAACGACATC GTTGA
 
Protein sequence
MSTIEFRNVR KIYPKAATAA LADISLTINR GEFVILLGPS GCGKTTLMKM INRLIEPSSG 
TILLDGVDIQ SINATKLRQQ IGYVIQQVGL FPHMTVAENI AVVPKLLGWP KAKIQSRIDE
LLQLIQLDPA QFRQRYPAQI SGGQAQRVGL ARALAADPGV MLMDEPFGAI DAITRTALQD
EMLRIQQQLQ KTIVFVTHDV EEALRLADKI AILHEGTIVQ YDTPLNLLRN PANHYVAELL
GADDLVRRLS LIQVRHVLQP LPASYDSSLA SIESSRNLRD GLNQLLASSD EQLLVVEHNQ
PIGMLSLAAI HAYLHPEVIH ERHR