Gene Haur_1928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1928 
Symbol 
ID5733817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2332527 
End bp2335145 
Gene Length2619 bp 
Protein Length872 aa 
Translation table11 
GC content53% 
IMG OID641279072 
ProductFG-GAP repeat-containing protein 
Protein accessionYP_001544699 
Protein GI159898452 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATTT CGCTCAAAAT ACTTAGCAGC TTGCTCATGG CAGCGCTGTT GATAACGATC 
ATGCAGCATC CCAAAGCTCA AGCGGTTCCC GCCGCCACGA TTGTGTTTGA TCCGGTGGTA
ACCTATCCCA CAACCGGCAG CGGTGCATTT GTGGCAACTG GCGATTTCAA TCGCGATGGC
GTGGCCGATC TCGCGGCAAC CTCACGCGCC ACCAATGAAA TTAATGTGTT GCTTGGCAAT
AGTGATGGCA CATTTGCAGC GACGGTAGCC TATGCAACTG GCAATTTTCC AATCTCAGTC
GCCGTCAGTG ATGTCAACCA CGATGGCAAC GATGATTTGG CAGTAGTTAA TTTTGGCTCA
GCTTCAGTCT CGCTGTTGCT TGGCACAGGC ACAGGCAGTT TCAACGCGGC AGTTAATCTG
GCAACGGTTG GCGCACCCGA ATCAATTCAA ATCGCCGATC TGAATCGTGA TGGTCACCCC
GACCTAGTTG TTGGCGGATT AACCGCAACC AATAATGTTG GGGTTGCGCT GGGCAATGGT
AGCGGCGGGT TCGCCAGCCC TTTGCTGAGC ACAGTTGGCG GCGGGGGCTA TGGCCTAACC
GTGGCCGATT TTAACCACGA CAATCGTATG GATGTGGCCA AAACGATCTA TACCGACAAA
GCTTTTACCG TTGGTTTCGG CAATGGCGCA GGCGGATTCA CTGGCGCGGT CAATTATCCG
CTGAGCGATC TTCCCTATGG TATTGTTCCT GGCGATTGGA ATCACGATGG CAACCGTGAT
CTAGCAATCG TCATCCGACC GCCAACCAAT CAGGTCGCGG TGGTATTTGG CAGTGCTAAC
GGTAGTTTCG GCGCACCAAC CTACTATCCA GTTGGCCCAA CCCCCGAATG GCTCACTGCC
AGCGATCTCG ACAACGATGG CAACCAAGAC CTAGCGGTGG CAACGATCAG CGGCAATGTG
TGGGTCTTGC AAGGCAGCAG CAACGGCATA TTTCATGATG CAGGCTCATT TAGCGGAGCC
TTTGTGCCAC GGGCGGTTGC CGCTGCCGAT TTCAACCGCG ATGGCCGCAA CGATTTAACA
ACCAGCAACG AAGGCGCAAT TGTTGGGGTT TTGCTGAACC GCAGCGATAG CCAATGTGGT
GGCATTGGCT TTAGCGCAAC AACCACAGCG ATTGGAGCCG AAACCTTGGT TGGCAGCACC
ACCAGCGATA TGAATAATGA TGGTGATCTC GATTTGGTAA TCGCCAACCC AAGCACCAAC
AGCATCATTA TTCGCTATGA TAATGGCACT GGATCGTTTA GCAGCGCCAT AAGTATCAGC
CTTGGGCTGC AACCATCCGA TGTAGCGGTT GGAGATGTCA ATCGCGATGG CCGCAACGAT
TTGATTGTGG CCGCCTTTTT GGCCAATCAA GCGCTTGTGC TGCTGAATAA TGGAGCTGGA
AATTTTACCC CCACCAGCTA TCCAACGCCC TTTAATCCGG TTGAAATCAC GATTGCCGAT
TTCAATAACG ATGGCAGCCC CGACTTTGCC AGCGCCAACT ACAACGCCAA TTCGATCAGC
GTGCGCATGG GCAATGGCAG CGGTGCTTTT GGCGCAGCAA CTGATTATGC AAGCGATGCG
CATCCCTTCA AACTCACCAG CATTGACCTT GATCGCGATG GCGCGATCGA TCTGGCGTGG
GTTAATTACG TGGCAAATAC CCTGAGCGTT CGCTTGAACA ATGGCAATGG CACGTTTGGC
TCAACCATCA ACACAGCGGT TGGGGTTGGC CCAACCTCGG TGAACTTTGG CGATTGGAAT
CGTGATGGCA TTGCCGATGC AGCGGTCACC AATCGTAGTG CTCACACACT TTCGATTCTG
CGCGGCACAG GCACAGGAGC ATTCAGCGTT ACCGCCACCA TGCCCAGTAA TCTGTTGCCG
ATGGATGTCT TGAGCCACGA TTTTAATCTT GATGGGCGGC TCGATTTAGC GGTATCCTAT
ACCTCAGAAT CAAGTGTCAC GATTGCGCGG GGCAATGGCG ATGGAACTTT CGCAGCACTC
AGCGCTTTTG GCCCACGGGT TGCCAACCGT GGTTTAAGTG CTGGCGACTT TAATCACGAT
GGCAAGATCG ATCTGGCGGG AGCCAATTAT CTCTCGAATA CCTTGGCCGT GCTGTTGAAT
AGTTGTCCGG CTGCGCCACC GCCGACCCCA ACCCCAACGA TAACTCCAAC GCCGAGCGCC
AGCAGTCAAC GATCGGCCTA CCTACCCCTA GTCTCGCTAC GTCAAATCAG CGTCTTGGCA
ACACTCAATG ATCAAGCGAT TCCAATTCGC CCGATCACTG TTCAAGGCGA AACCTACTAT
ACAACCACGA TTCAGCTTGT CACAAGCTTA CCGCCAGGCG GCAAATTCTA TTTTTCAGCA
TCACCACAGA GCGTACAACC AATCCAAGTT GACGATGAAC TGGTGGTGCG AATCAACGGC
CAAGTGCAAT TTAGCCAGAT CGCCACAACG CCAATGATCG TGGAAATTCC CCGCGCAACG
CTTGAAACGT GGGTAGGTCA ATCGCTCGAA ATTGCCTTTC GCGATGTCTA TGGCTCGCTG
GTTGGCAGTA GTCCCGTCTG GTTAATTTGG GTTCCTTAG
 
Protein sequence
MPISLKILSS LLMAALLITI MQHPKAQAVP AATIVFDPVV TYPTTGSGAF VATGDFNRDG 
VADLAATSRA TNEINVLLGN SDGTFAATVA YATGNFPISV AVSDVNHDGN DDLAVVNFGS
ASVSLLLGTG TGSFNAAVNL ATVGAPESIQ IADLNRDGHP DLVVGGLTAT NNVGVALGNG
SGGFASPLLS TVGGGGYGLT VADFNHDNRM DVAKTIYTDK AFTVGFGNGA GGFTGAVNYP
LSDLPYGIVP GDWNHDGNRD LAIVIRPPTN QVAVVFGSAN GSFGAPTYYP VGPTPEWLTA
SDLDNDGNQD LAVATISGNV WVLQGSSNGI FHDAGSFSGA FVPRAVAAAD FNRDGRNDLT
TSNEGAIVGV LLNRSDSQCG GIGFSATTTA IGAETLVGST TSDMNNDGDL DLVIANPSTN
SIIIRYDNGT GSFSSAISIS LGLQPSDVAV GDVNRDGRND LIVAAFLANQ ALVLLNNGAG
NFTPTSYPTP FNPVEITIAD FNNDGSPDFA SANYNANSIS VRMGNGSGAF GAATDYASDA
HPFKLTSIDL DRDGAIDLAW VNYVANTLSV RLNNGNGTFG STINTAVGVG PTSVNFGDWN
RDGIADAAVT NRSAHTLSIL RGTGTGAFSV TATMPSNLLP MDVLSHDFNL DGRLDLAVSY
TSESSVTIAR GNGDGTFAAL SAFGPRVANR GLSAGDFNHD GKIDLAGANY LSNTLAVLLN
SCPAAPPPTP TPTITPTPSA SSQRSAYLPL VSLRQISVLA TLNDQAIPIR PITVQGETYY
TTTIQLVTSL PPGGKFYFSA SPQSVQPIQV DDELVVRING QVQFSQIATT PMIVEIPRAT
LETWVGQSLE IAFRDVYGSL VGSSPVWLIW VP