Gene Haur_1694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1694 
Symbol 
ID5733578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1968472 
End bp1969548 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content50% 
IMG OID641278833 
Productpermease 
Protein accessionYP_001544465 
Protein GI159898218 
COG category[R] General function prediction only 
COG ID[COG0701] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000186998 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTTG TCAGTATCGA TGTTAAACCC AAAAACCAAC GCAAATGGCT ACTCTTTTTC 
GCGCCAATCA TCGGCATTCT TGGGCTGTGG CTGGCGAGCG GCTGGCTTGT CCCAAGCAAT
TTAGCCCCCT TAACCAACAA ACTTCAAGGC TTAGTCACCA CCTTTCAAGG GATTTTTATT
GAGGCGCTGC CCTTTCTTAG TGCAGGGGTG ATTGTTTCGG TATTAATTGG CGAGTTCGTC
AAGCCGCAGC ATTTGGCCAG TTTTGTGCCC CAAAATGCCT TTGGAGCCTC AATTTTTGGC
TCGCTTTTGG GCTTGCTGTT TCCGGTCTGC GAGTGTGGGG CGATTCCAAC CAGTCGGCGG
TTGTTGCGCA AAGGCGCACC AGCCTCAATG GGAATTGCCT TTGCCTTAGC GGCCCCCGTG
GTCAACCCAA TTGTGCTGAT CTCAACCTCG ATTGCCTTTG GCGATGTGCG TTGGGCTTTG
GCGCGGGTCG GCTTTACAAT CATCATTGCC TTAACAATTG GCTTGATTAT TGGAGCTGGA
ATTAAACGCG AAGCAATTTT GACCCCACTT GCCCTAACCC CCGATGTTGA ACATGATCAT
AGCCATTGCG ACCATGATCA TGGTGCTTGC GACCATACCC ACGAACAACC CAAGGGTCGT
TTGGCAGGCC TGATTGCCCA CGGCAGCGTT GAATTTTTTG AGATGGCCCA GTATTTGGTG
ATGGGTTCGT TGTTGGCAGC GACTATGCAA ACCTTCATTC CCCAATCGGC CTTGCTCACT
TTAAATGATA GCGGCATCGG CTTTTTTGCT CCGTTGTTGG GGATTGTGGT ATTGATGTTG
GTGGCAGTGC TGCTTTCCGT GTGTTCCACG GTTGATGCTT TTTTGGCCTT ATCGTTCCTT
GGCTTGTTTC ATCCAGGTGC AGTCATGGCC TTTTTGGTCT TTGGCCCGAT GATTGATATT
AAAAGTACCT TGATGCTGAC CACGACATTC CGCCGCTCAG CAGTGATGGC AATGGTCGTG
CTAGCAGCCT TATTTGCAAT TATTGCTGGC TTGATCAGCT ATGTTGTTTT GATCTGA
 
Protein sequence
MAVVSIDVKP KNQRKWLLFF APIIGILGLW LASGWLVPSN LAPLTNKLQG LVTTFQGIFI 
EALPFLSAGV IVSVLIGEFV KPQHLASFVP QNAFGASIFG SLLGLLFPVC ECGAIPTSRR
LLRKGAPASM GIAFALAAPV VNPIVLISTS IAFGDVRWAL ARVGFTIIIA LTIGLIIGAG
IKREAILTPL ALTPDVEHDH SHCDHDHGAC DHTHEQPKGR LAGLIAHGSV EFFEMAQYLV
MGSLLAATMQ TFIPQSALLT LNDSGIGFFA PLLGIVVLML VAVLLSVCST VDAFLALSFL
GLFHPGAVMA FLVFGPMIDI KSTLMLTTTF RRSAVMAMVV LAALFAIIAG LISYVVLI