Gene Haur_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1707 
Symbol 
ID5733594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1985838 
End bp1988378 
Gene Length2541 bp 
Protein Length846 aa 
Translation table11 
GC content53% 
IMG OID641278849 
ProductTPR repeat-containing protein 
Protein accessionYP_001544478 
Protein GI159898231 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCTT TCGATGCCGA TTCGACCTAT CTCGCAGTGC TGTTGGTGTT TGAGCCATTT 
GATTCTGAAT TATGGCCGCT ACTGGCTCCG GCTGCGCCAA GCGTCGAGCA ACTTCAAGCT
GCTGGTTGGT TGCAAGCGAG CGCCGCTGGA TTAAGTTTGC TGACAGAGCG CCGCGAACAG
CTGCAAGCCC AGTTCGATTC GGCCAGCATT CGCGCAGCCT ATCAGCAATT AATCGCTGCT
GAACTGAGAT TAAGCGACAA TCAACCAGCG TTTGCCGCCC GTTTATTTGC CGATCTGCGA
ACGTTTGCCG AGTTGCTGAT TCGCCAAGCG CCACATGAAC TGGCCGATCT GGTCAATCAA
ATTCCAGTTG ATTTAGCGCC AACCAAGGCC GATCGCCAAC TGTTGGAGTA CTACCGTGGC
CTCGCCGCTG GCCTGCGTGA TCAATATGCA GAGGCATGTA CGTTGCTCAG CCAATTATTG
GCGCAACCAG ATTTGGAGCC AATGATTCGC GGGCGGGCGC TCAACTCCGA TGCCACGTTT
GCCCGCTACA GTGGCAATTA TGATCGGGCG TTGGCCAATT ATCAAGCCAG TTTTGCGCTG
TGGGAAGCCC AGGCTGATCC AGTTCGTCAA GCCTATGTGC TGCTGAATGA AGGTAGTTTG
CGCTACCACC TGCAAGAATA TCGCGCTGCT GAACGCTGCT TGAATAGCAG TTTAGCGACC
TTACAAGCCC AAAATTTATT GTATCCCCAA GCTTTGGTGC TGATCAACCT TGGTTTGTTG
GCGCGTGATC GTGGCAATTG GGCGCAGGCC TTAAGCCATT TTCGGCAGGC TGAAGCTATT
TTGCAAGCTG AAAATGCCAC CGATTTTCTG GGGCGCATCG CCAATAATTT GGGCGAATTA
GCCTTGTTGC AGGGCCAATA TCAAGCTGCT CGTGAGCATT TTGAGCAAGC CCTAGCCCAG
ATGAGCAGTC GAGTTTATCA CATTGATGCC TACTTGAATT ATGGTTTGGC CTGGCATGTT
GAAGGCATGT TTGAGCAAGC TGAAACCGCC TATCGCCAAG CGCTCGATCT GGTGGAAAGC
GTTGAGCGCC AAGAAATCGC CGCTTTAGTT TGGTTTCGTT TGGGCCAAGT TGCAGCGGCT
CGCAATGATC ATCACCAAGC CGAGCAGCAT TATTTGCAAG CAATTGAGCT GATCGAGGCG
ATGCGTGCAC CCATTTTGGC CGAAAGCTTG CAAATTAGTT TGATGGGGCG GTGGCAACAG
GTCTACGAAG GGGCGGTGGC GGCCTATCTC GCTCAATCCA ATGTTGAAGC TGCCTTTGTG
ATGGCGGAAA AAGCCCGTGC TCGCGCACTC AACGATTTGC TGGCCCGCAA CGGCCAAACC
AACCAAGCAA TTGGCACAAT TCCCAGTTTG AGTGAGTTGC AACAAAGCTT GGCGCAGGGC
AGCCTTATGC TCGATTATAT GACAATTGGT GCGGTTGGGC CTGAGGCCAG TTTGTTGGCG
GCTTTGCCTG CGAGTGCCAA AGCCTTGCGC AGCTTGTTAA TCCAGCCGGC AGCAACTTGG
CTATTTGCAA TTACCGCTGA GCAAGCTCAA GCCTTCAATT GCCAAATCGA CCCCAATATT
CTCTTGGCGA CCTCACCATT TCAGTGTGAT GGACGACGCT TTTTGCGTCC GGCAATTTTG
CAGCGCTTGC AGCAACGTTT GCTTTTGCCA GCTCAAGCTT ACTTACAACA GGCGCAACAG
GTGATTATCG TGCCGCATGG AGCTTTGCAT CATGTGCCGT GGAATGCCCT GTTGCTGGGC
GAACTTCAGC TTGATCTGCC TTCGACTACC ATTCCAAGCG CTGCTAGCTA TTTGCAATTG
AGCCAACGCC CACCGAGCCA AGCTCCCGAG GCTTGTGGGG CATTAAGTTA CGCTGGTGGG
GTTGAACCAG CCTTGGTGCA TACGCATGCC GAGGCCGAAG CAGCGGTGCA GGCACTTGGC
GGCCAGCATT ATCCATTGCC AGTGCCCAAT ATTCAACAAG CGCTTGGCAA TTATCGCATT
GTGCATATTG CCTGCCATGG CGTGTTTGTG CTCGACCAAC CTTTGGCTTC ATGGCTGCAA
TTTGGGCCTG AGCAAACCGT CTCGGCTTTG GAGATATTAA CAACTTGGCA ATTAGCCGCT
GATTTGGTGG TGTTGAGTGC TTGCCAAAGT GGTGTGAGCG AAATTGTGCG GGGCGACGAG
CCATTTGGTT TGGTGCGGGC ATTTTTGGCA GTTGGCGCAC GCGCAGTGTT GGTCACACTA
TGGCCAGTTG ATGATGTGGC CAGTGCGGTG TTGATGAAGC TATTTTATCA AGCCCTGCAA
AGCGGTGCTG CCCCCGCCGA GGCCTTACGC CAAGCAGTGC AACAGATTCG CAGCATGCCT
CAAACACAGG TAGCTATGCC GCTTGCTCCA AGCCAGCAGA CAGAGTACCC GTTCGCCGAT
CCGCATTATT GGGCGGGCTA TCAGTTAATT GGCGTTGGCA GCTCGATTTC TACTGTCAAG
CCAGCCACAT CAGCCGCTTG A
 
Protein sequence
MTAFDADSTY LAVLLVFEPF DSELWPLLAP AAPSVEQLQA AGWLQASAAG LSLLTERREQ 
LQAQFDSASI RAAYQQLIAA ELRLSDNQPA FAARLFADLR TFAELLIRQA PHELADLVNQ
IPVDLAPTKA DRQLLEYYRG LAAGLRDQYA EACTLLSQLL AQPDLEPMIR GRALNSDATF
ARYSGNYDRA LANYQASFAL WEAQADPVRQ AYVLLNEGSL RYHLQEYRAA ERCLNSSLAT
LQAQNLLYPQ ALVLINLGLL ARDRGNWAQA LSHFRQAEAI LQAENATDFL GRIANNLGEL
ALLQGQYQAA REHFEQALAQ MSSRVYHIDA YLNYGLAWHV EGMFEQAETA YRQALDLVES
VERQEIAALV WFRLGQVAAA RNDHHQAEQH YLQAIELIEA MRAPILAESL QISLMGRWQQ
VYEGAVAAYL AQSNVEAAFV MAEKARARAL NDLLARNGQT NQAIGTIPSL SELQQSLAQG
SLMLDYMTIG AVGPEASLLA ALPASAKALR SLLIQPAATW LFAITAEQAQ AFNCQIDPNI
LLATSPFQCD GRRFLRPAIL QRLQQRLLLP AQAYLQQAQQ VIIVPHGALH HVPWNALLLG
ELQLDLPSTT IPSAASYLQL SQRPPSQAPE ACGALSYAGG VEPALVHTHA EAEAAVQALG
GQHYPLPVPN IQQALGNYRI VHIACHGVFV LDQPLASWLQ FGPEQTVSAL EILTTWQLAA
DLVVLSACQS GVSEIVRGDE PFGLVRAFLA VGARAVLVTL WPVDDVASAV LMKLFYQALQ
SGAAPAEALR QAVQQIRSMP QTQVAMPLAP SQQTEYPFAD PHYWAGYQLI GVGSSISTVK
PATSAA