Gene Haur_3117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3117 
Symbol 
ID5734989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3932678 
End bp3934960 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content49% 
IMG OID641280261 
Producthypothetical protein 
Protein accessionYP_001545883 
Protein GI159899636 
COG category[S] Function unknown 
COG ID[COG1300] Uncharacterized membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.247741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAACC AAGCCTTAAC CATTACACGC CGCGACTTAC GCGATACGCT CAGCGATTGG 
CGCACATTGA TTCCGTTATG CATTCTTTCG GTGTTGTTGC CGCTAATTCT GCGGGCGGGG
GTAGCGCAAG CAGTGAGTTT TCTCGACGAT GAATTTATCA ACCGCAGTTT AGTGCCATTA
GGTCTGTTGA TTGCGGGATT CTTGCCAGCC TCACTCTCCT TGATTGGCCC CTTGGAATCG
TTTGTCGGCG AACGTGAACG CTCAACCCTG GAATCGTTGT TGGCCATGCC AATGAGCGAT
CGTGGACTCT ATCTAGCCAA GTTTTTGGCG GCACTATTGC CACCAATTGG CTCCTCGGCA
GTGGCGATGC TGATGTATAG CCTAGCGATT GAGCTTGGGC GACCAGTACG CGTAGCCTTT
GTGCAAAAGC TGCTCAGCGA CTATTTAACC AGCGGCTGGA TTATTGGCAT CACATGTATT
TTGTTGATTA AAGTTTTTAC CATGGTTGCG GCAGCTGTCT ATGTTTCGTC ACATACTACC
AGCATTCGGG CAGCCAACTT ATTAGCCAGT TTCATCCTAA TTCCAATGGC AATTTTGGTC
CAGATCGAAG CACTGATCAT CATCAACGGC ATGTTTTTGC CAATTGTGTT GATTAATGGT
TTGTTGCTCG TGGTTGGTCT GACCATGGTT GGTTGGGGCA TGTATAGCTT TAATCGCGAA
GAATTGCTCT CACGTGAGCA TGAAAGCATC AGCAAACGAG CCTCAACCCA ACGCCTGAGC
CAATCGACCC GCACCTATGG CCCTGTGATG ACCATTGTAC AGCGCGAGGC TGTTGACACG
CTCAGCGACT GGCGAATTTT GGTGCCAATT GGCCTCTTGA CCTTTTTAGT GCCAATTGGC
GCTTTGTTTG GGGTCATTTA TGCTTTTGCT AATGTCGATG ACCCGACTGG GGTGGTTAAT
CTATTGCCCT TTATCGTGTT GCTGGTGGGC TTCTTACCAG CCTCATTTTC GCTGATTGTG
GCACTTGAAG TTTTTGTTGG TGAGCGCGAA CGTGGGTCGC TGGAGTCATT ATTCTCAATG
CCGATCAGCG ATGGTCAGTT GTATCGAGGA AAGTTATTGG CGGCGATCGT GCCACCAATT
GGCGCAAGCT TGGTTGGCAT GCTGTTGTTT GGGGTTGGAC TGAGCGTCTT TGCCCCAACT
GCCTTGCTGG GACGGATCAA CTTAAGCATT TTTAGCCAAA TGGTCTTGCT GAGTATTGCC
CAAGCCTTGA CGATGGTTTC AGCAGCGGTG GTGATTTCGT CGCATACCAA CACCGTGCGC
TTAGCCAATT TATTGGCAAG CTTTATTTTG ATCCCAGTGG CGATTATGGT ACAGCTTGAA
GCGGTGCTGA TTATCGGCGA ACGCTTCGAC GTGCTGAATG CGATAATGGC AGTAATGTTA
ATTTTGACAA TCGTCTTGAC GCGTACCGGC ATTGGTAGCT TTAAGCGTGA ATCAATTCTA
TCGCGTGAGC ATTTAGCCCT CAATTTTAGT CAAATCAATC GAGCCTTCAA AGCCTTTTTC
AGCGAAATTC GCCCAGCAGG CAGCAACCCC GATAGCCATC TCGGCTTATT CAATGTTGAA
ACTACTAATG GCCCACAACG CAATCTAGGA CTGTGGCTCA AACGCTTCTA TCGCCAAGAA
TTAGCTGTCG TTTGGCGCGA GACCCGACTA GCGCTGGTTG TGGTGCTCCT CTTTTGTGGC
GCAGCAATCC TTTTTGGTAG CCAATTTAAT CCAGTCAGTA GCCGCGAACA ATCATTGGCC
AATGTTATGA CGCGCTTAGA CGTAGGTCAG GGCACACTCT GGGAGCCGCC ATTAAGCTTT
ATTGTCATTA GCAATAGCTT TGCAATCTTT TTTGCTGGGC TATTTTCAAG CATCACTTTA
GGCTTTTTTG GCTTGATTCT GCCAGCCATT AACCTAATGG GATTGAGTTT TCGAACCAGT
AGTTTAGCCG CAAGCGGCGG CCTAGCAACT GCTCTCAATT ATCTAGTGGG CTATGAATTG
CCGCATGGGT TATTGGAAAT TCCGCTAAGT ATATTTGCGG CAGCCTTGGC ATTACGTATG
GGTGCGGCTT TGGCGTTTGT GCCACCAACC TATAGCGCTG GGCGGCATTT GCTCTGGGCT
TGGGCCATGT ATCTCAAAGT ATTTTGCTTG CTAATTGTGC CTGGACTCGT GCTGGCGGGC
TTGATTGAAG TGTTGGTAAC CCCAGCTGTG CATCAGATGG TCTATGGGTT TCTGTGGATT
TAG
 
Protein sequence
MRNQALTITR RDLRDTLSDW RTLIPLCILS VLLPLILRAG VAQAVSFLDD EFINRSLVPL 
GLLIAGFLPA SLSLIGPLES FVGERERSTL ESLLAMPMSD RGLYLAKFLA ALLPPIGSSA
VAMLMYSLAI ELGRPVRVAF VQKLLSDYLT SGWIIGITCI LLIKVFTMVA AAVYVSSHTT
SIRAANLLAS FILIPMAILV QIEALIIING MFLPIVLING LLLVVGLTMV GWGMYSFNRE
ELLSREHESI SKRASTQRLS QSTRTYGPVM TIVQREAVDT LSDWRILVPI GLLTFLVPIG
ALFGVIYAFA NVDDPTGVVN LLPFIVLLVG FLPASFSLIV ALEVFVGERE RGSLESLFSM
PISDGQLYRG KLLAAIVPPI GASLVGMLLF GVGLSVFAPT ALLGRINLSI FSQMVLLSIA
QALTMVSAAV VISSHTNTVR LANLLASFIL IPVAIMVQLE AVLIIGERFD VLNAIMAVML
ILTIVLTRTG IGSFKRESIL SREHLALNFS QINRAFKAFF SEIRPAGSNP DSHLGLFNVE
TTNGPQRNLG LWLKRFYRQE LAVVWRETRL ALVVVLLFCG AAILFGSQFN PVSSREQSLA
NVMTRLDVGQ GTLWEPPLSF IVISNSFAIF FAGLFSSITL GFFGLILPAI NLMGLSFRTS
SLAASGGLAT ALNYLVGYEL PHGLLEIPLS IFAAALALRM GAALAFVPPT YSAGRHLLWA
WAMYLKVFCL LIVPGLVLAG LIEVLVTPAV HQMVYGFLWI