Gene Haur_0002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0002 
Symbol 
ID5736836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1544 
End bp3412 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content46% 
IMG OID641277123 
Producthypothetical protein 
Protein accessionYP_001542782 
Protein GI159896535 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000285538 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTTT TGGATCTACT TTTTGGGCGG CCACTGGCAA ATGAGGATGA GGAACATCAA 
CGAGTTGGTG TTGTAGCAGG GATTCCCATG TTAGGGTTAG ATGCGCTAGC CTCGGCAGCC
TATGGCCCTG AGGCAGCTTT AACGATCTTA CTACCATTGG GTTTGCTGGG CATTAATGCC
ATAACGCCGC TCGTTGCAAT TATCATCGTA TTACTGGGAA TCGTTTTTCT ATCCTATCGC
CAAACAATCA CAGCCTATCC AAATGGTGGC GGCTCCTATA CGGTTGCCCA TGAAAATTTA
GGAGTTATTC CTGGCCTCAT CGCCGCCGCC GCGCTTTTAC TCGATTATAT TCTTGTTGTA
GCAGTTGGTA TTTCGGCTGG CGTAGGTGCA CTCGTTTCGG CAATTCCAAA ACTACAACCC
TATATGCTCC CACTCTGTTT ATTAATTTTA GGGTTAATAA CGATTGTCAA CCTGCGAGGT
GTTCGTGAAT CAGGGCTAGC ATTTGTGATC CCTACCTATC TATTCATTGC TTGTATGTTG
ATTATCTTAG CCATGGGAGC TTATTTTGTA ATCATGAGTG GCGGTAAGCC AATCGCTAAA
ATTGCTCCTG CGCCACAACC ATCAACCATG ACAACCTTGA GCTGGTGGCT CCTAATTCAA
GCCTTTGCTA GTGGTTGTAC TGCCATGACC GGGGTTGAAG CGGTAAGTAA TGGGGTAAGT
GCCTTCCGTC AACCAGCGAC CCATTATGCT CGCCGCACCT TAACAATTAT CATTGGAACA
TTGATGGTGA TGCTTGCAGG CATTGCTTGG CTGGCTAAAT CGTATCAAAT TGGGGCGACC
GAACCAGGCA AGGCTGGCTA TCAGAGCGTT TTATCGCAAC TTGTCGCTGC CGTGAGCGGG
CAAGGCATCC TCTATACTCT AACAATTGGC TCTACTTTAG CAGTCCTCGC TTTGTCAGCC
AATACAGGGT TTGCCGATTT CCCACGGCTC TGTCGGATTC TTGCCCACGA TCATTTTCTA
CCCCATGCTT TTGCCTCACG CGGACGACGT TTAGTTTATA GCATCGGGAT TATAGTCCTA
GCGAGTTTTG CTGGGATTAT CTTGATCATC TTTGGGGGGA TTACTGACCA TTTGATTCCA
TTATTTGCGG TGGGAGCCTT TTTAGCATTC ACGCTTTCCC AAACCGGAAT GGTGCTGCAC
TGGTTTAAGC ATGGTGGCCT CAACGCCCGA CGCAATATGT TGATTAATGG AGTCGGTGCA
GTCTCAACCG GCATAACCTT GATCGTTATT TTAGTCGCAA AATTTGCGAC TGGTGCATGG
ATTACCTTAG TCATCTTGCC AGCATTAGTT GGTTTATTTC TCGCGGTACG ACGACACTAT
CAGCAAGTAG CCCAACAAGT ACAGCTGAAT ATTGCCTTAG ATACCAGCAA TTTAACCGCC
CCGATTGTGG TTATTCCGTT TGGTGGTTGG AATAAAATGG CCCATAAGGC ACTACGCTTT
GCATTAAAAA TATCACCTGA TATCTATGCT GTTCAGATAA GTACTGCTGA AGAAGCAGCA
ACGAAACGAG AACAATGGGA ACAAATTGTA CTTAAGCCAA TTCAGGAGGC AGGGTTGGCT
CAACCACATT TTGAATTAAT CGAATCGCCA TATCGGCAGT TGTTCGGGCC ATTAATGCGC
TTTATTCTTG ATTTACGTGA GGCCAACCCT AACCGCCAAA TTGCTGTAAT TATCTCAGAA
CTAGCTGAAA ATCGCTGGTA TTACTATTTA CTGCATAAAC AACGGGGAAT GGTTTTAAAA
GCCCGTTTGT TTTTTGGCGG TAATGCCCAA ATTATTGTGA TTAATGTTCC TTGGTATTTG
GAACATTAA
 
Protein sequence
MSVLDLLFGR PLANEDEEHQ RVGVVAGIPM LGLDALASAA YGPEAALTIL LPLGLLGINA 
ITPLVAIIIV LLGIVFLSYR QTITAYPNGG GSYTVAHENL GVIPGLIAAA ALLLDYILVV
AVGISAGVGA LVSAIPKLQP YMLPLCLLIL GLITIVNLRG VRESGLAFVI PTYLFIACML
IILAMGAYFV IMSGGKPIAK IAPAPQPSTM TTLSWWLLIQ AFASGCTAMT GVEAVSNGVS
AFRQPATHYA RRTLTIIIGT LMVMLAGIAW LAKSYQIGAT EPGKAGYQSV LSQLVAAVSG
QGILYTLTIG STLAVLALSA NTGFADFPRL CRILAHDHFL PHAFASRGRR LVYSIGIIVL
ASFAGIILII FGGITDHLIP LFAVGAFLAF TLSQTGMVLH WFKHGGLNAR RNMLINGVGA
VSTGITLIVI LVAKFATGAW ITLVILPALV GLFLAVRRHY QQVAQQVQLN IALDTSNLTA
PIVVIPFGGW NKMAHKALRF ALKISPDIYA VQISTAEEAA TKREQWEQIV LKPIQEAGLA
QPHFELIESP YRQLFGPLMR FILDLREANP NRQIAVIISE LAENRWYYYL LHKQRGMVLK
ARLFFGGNAQ IIVINVPWYL EH