Gene Haur_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4038 
Symbol 
ID5735900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5154098 
End bp5155888 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content55% 
IMG OID641281189 
Producthypothetical protein 
Protein accessionYP_001546798 
Protein GI159900551 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGTTGT TGTTACGGCA ACGTGTGAGT TTAGCCTTGG TTGTTGTGGC GCTCGCAGCA 
TGTCAACAAG CCCCACAGCC CACTGCTCGC CCGTTACAAA ATGAGCTAAC TATTCTCGCC
TTAACTCCGA CAGGCACGGC TACTCCTACG GCAACGGTGA CATTAACGCC TGCACCTGCG
TCGCCAACAA CCGAGCCTTC GGCTACCCCA ACCCGCACAG TTGGCCCTTC GCCTACCAAA
GGCGCTTCTC CAACCGCTGG CCCTTCACCA GTCGGTACAC CTCGGGCCAC TCCCGCGAGT
ACCCCAACTG CCAATCAAAG CAGTAGTTTT TGTACGCAGC CATTTGGGGC GGTAACCGAC
GAGCGCTTCA GCGCCCGCTT AAATAACGCT GGGCTTGATC GCACGCCCAA TGGCGATCGC
TTATTCTTAG AATTAACCTC CAGCAGCGGC CCCGTCAACG GTGTCGTGCG CTGTGTGCCT
CCAGCAGCGG CCCAATTGCT CGCAGGCGAT AGCGCAATTG CCAGCGTGAT TCAAATCGAT
CTGCCGCTAT GGCGACACGA TGATCTCTGG CGTTCATCGT CGGTCACGCT CACCAAAGTG
CTCAAACTTG ATACGCTGCA ACATGTACGC TCAGTCGTTT CGCAATCTAG CAGCGATTCA
GCTGGGGTAT TAATTGAAAT TGGGCTGGAT CAGGCTTTGC CATTTACGGT GCAGCTCGAT
GGCGGACGCT TAAATGTGGT GATTGCTGAT AGCGCTACCG CCACGCTGGG CGATGATCCA
CTAGCCAAGA GCAATGGTTC ACCCAGCGCA CCCAAGCAAC CAGTTGTGTT TGCCAGCAAA
GGCGATTTGT ATCGCTACGA AAGCTCGCGA GTCGTGCCAA TTACCACAAC CTTGGCAATT
GAAAGTGCAG TGGCAATCAG CCCTGATCGT ACCCAAATTG CCTTCTGTCG CGCCAACCCC
GATGGCTTGC CAACCCAAGG CGCACTCTGG ACCAGCACAA TTGATGGCGA TAATGAAACC
TTGGTCGCTG ATGTTGGTGG TTGTGCCGAG CCTGCTTGGT CGCTTGATGG CGGGATCATT
TGGTTTACTG CCCCTTGGAG CGATGCGGCC CCTGATAGCT ATCGACTCTG GCAGGTGAAA
GCCAATGGCG GCGATGCGAG CGCGGTCTCG CCGCTCGACG AGTGGAGCCG CCGTATGCCG
CATGCCTTGC CCGATGGCTC AGTACTGACG GTTGGCCATA CCGATGGCGG TCAAGGTGGC
TTGTTGATCA GTAATCCGTT GAGTGGCACT GATGGATTGC TTGGCCAAGC GAGCTTGGGC
AATTATCGCA GCGTTGGTCA AGCCCAAGTT TCGGCTGATG GCACCCGGAT TGCCGTCGAA
GCGCTCCGAG CTGATGGTGG CGCGGATCTC TTGGTGCTCG ACCAAACTGG GAAGCAACTC
GATGCAATTA CCGACCAATG GTGGGTACGC CCGCTCTCGT GGAGCAGCGA TAACAAACTC
TACTATTTAA ATGTGGCTTG TCGTAGCGGC CAAGTATTGA ATTATAGCCT GCACAGTCGC
CAAGGCAGCA ACGATAGTCA AATTATCAAA GGTGCAACCC TTGGCGATTT AGGTTCAGTC
GCTGTGGTTG ATGATGCGCT GTTGTATGTT CGGGCTTTAC AATCGCCCGA CAACGAACGT
GGTGCAGAGC CTATGATTAG CGGCCCTAGC GAGTTGTGGT TGTATGATCT TTCGAATACA
GCTCGTACCC GCCTGATTGC CGCCGATGAT GGAATTACCA GCGTCAAGTA A
 
Protein sequence
MKLLLRQRVS LALVVVALAA CQQAPQPTAR PLQNELTILA LTPTGTATPT ATVTLTPAPA 
SPTTEPSATP TRTVGPSPTK GASPTAGPSP VGTPRATPAS TPTANQSSSF CTQPFGAVTD
ERFSARLNNA GLDRTPNGDR LFLELTSSSG PVNGVVRCVP PAAAQLLAGD SAIASVIQID
LPLWRHDDLW RSSSVTLTKV LKLDTLQHVR SVVSQSSSDS AGVLIEIGLD QALPFTVQLD
GGRLNVVIAD SATATLGDDP LAKSNGSPSA PKQPVVFASK GDLYRYESSR VVPITTTLAI
ESAVAISPDR TQIAFCRANP DGLPTQGALW TSTIDGDNET LVADVGGCAE PAWSLDGGII
WFTAPWSDAA PDSYRLWQVK ANGGDASAVS PLDEWSRRMP HALPDGSVLT VGHTDGGQGG
LLISNPLSGT DGLLGQASLG NYRSVGQAQV SADGTRIAVE ALRADGGADL LVLDQTGKQL
DAITDQWWVR PLSWSSDNKL YYLNVACRSG QVLNYSLHSR QGSNDSQIIK GATLGDLGSV
AVVDDALLYV RALQSPDNER GAEPMISGPS ELWLYDLSNT ARTRLIAADD GITSVK