Gene Haur_4363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4363 
Symbol 
ID5736223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5573524 
End bp5575425 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content50% 
IMG OID641281524 
Productcell wall anchor domain-containing protein 
Protein accessionYP_001547123 
Protein GI159900876 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTGC TACGTGCGTT CGCGTTGCTG CTATTGCTGG CAGTTTTGGC TCCACTCGGC 
TTGCAGAGTG GGGTTGTAAG TGCCCAAAAT GAGCAACGCA CGGTCACTAT TTTGGAAACG
AGCGATATCC ACGGCAATTT GATGGCTTGG GATTATTACG CCAACAAGCC TGCCGAATGG
GGTATGACCA AGGTTGCAAG CTTGATCAAA CAAGAACGGG CGATCGATCC CAATCTCTTG
TTGGTCGATA ATGGTGATAC GATTCAAGGT ACGCCCTTGA CCTACTACTA CAACGTGATC
GACCAAAATG CCGCGCATCC AATGGCAGCG GTATTTAATG CGCTCAAATA TGATGTTTCA
TCGTTGGGCA ACCACGAATT TAATTATGGG ATGGATGTGC TGAATCGCTA TATCAGCCAA
GCTCAATACC CAGTTATGAG CGCCAATGTG CGCAAAAGCG ATGGTAGCGA AGCCTTCAAG
CCCTACATTA TTAAAGATGT GAATGGCGTG AAAGTAGGCT TTTTGGCCTT GACCACGCCA
ACTGTGCCAA CCTGGGAAAA ACCCGCCAAC ATCGCGGGCC TGCAATTTGC CGATCCGGTA
GAAGTTGCCA AGCAATATGT ACCGCAAATT CGGGCTGAAG GTGCGCATAT TGTGGTTTTG
CTGCAACACA CGGGCTGGGA AAAACAGCCT GCCGAAGCGA CCAAACCCGA AGCATGGCTA
ACCGACCCCA GCACTTGGCG CGATACTGGC TCGTTGCCAG GCGAAAATGT GTCGATCAAA
CTTGCCCAAG AAGTGCCTGG CGTTGACGTG ATTTTGACTG GTCACTCACA CTTGAGCGTG
CCCAAGGCGA TTATCAACAA TGTGTTATTG ATCGAGCCAT CGTATTGGGG CCGCGCTTTG
GGCAAAGTGA CGATTACGGT TGAGAAAAAT GGCGATAGCT GGAATGTGGT TAACAAAGAT
TCAACCAACA TTTCAGTCAC CAATGTTGCC GAAGATCAAG AGATTAAAGC ACTGGTACAA
CCATATCACG ACCAAACCTT GAGCTATATT AGCCAACCAG TTGGTACGGC TAGCGCCGAA
TTTGCTGGCG GCCCCAAGGC GCGTTATCGT GATAGCGCTT TGGCCGATTT GATCAACAAT
GTGCAAAAGC AAGCCGCTGC TGATGCTGGC TACCCCGTTG ATCTCTCGTT GGCAGCGATT
TTCACCGATG GCGGCATGAT TCCGGCGGGC CAAATTACCC TGCGCGATGC CTACAGCATC
TATATTTACG ATAATACGCT GTATGTGATG GAAATTAATG GTGATATTCT GCGCCGTGCT
TTGGAGCGTA ACGCCGAATA TTTCCGCCAG CTTGATCCCA ATGCCTTGCC CAGCGATCCC
AAAGCGGTAG TCAACGATAA TGCCCGCGAT TACAACTGGG ATTTATACAC CGATATCGAC
TATAGCTACG ATTTGACCAA GCCAGCCGGC CAGCGTGTGA CCAAATTGCA ATTGAATGGG
GTTGATATTA CACCTGAACA AACCCTGCGC ATCGCGATCA ACAATTACCG AGCTGGCGGC
GGCGGTGGCT TTGCCATGTT CCGTGAAGGC AAAATTGTCT ATCAATCGAC CAGCGAAATT
CGCGATTTGA TCGCTGAGTC AGTCAAAAAT GCTGGCACAA TTGATCCGAC GGTGGTGAAT
AAGGTTAATT TTACCCTTGT GCCAGATTTA TATGCCCACT ATTTTGGTGC TGCCAGCCAG
CCGACTGCTA CGCCAGTGCC AGCCCAACCA ACTGCCACTC CAGCGCCAGG TGTGCCAATT
ACCTTGCCTG ATACCAGTGG TAACCAACCA AGCTATGCCT GGGTTTGGGC GGCTGTCGCC
ATGGCCTTAC TCGCTTTAGG TTTGGTTGTG CGCCGCAATT AA
 
Protein sequence
MRLLRAFALL LLLAVLAPLG LQSGVVSAQN EQRTVTILET SDIHGNLMAW DYYANKPAEW 
GMTKVASLIK QERAIDPNLL LVDNGDTIQG TPLTYYYNVI DQNAAHPMAA VFNALKYDVS
SLGNHEFNYG MDVLNRYISQ AQYPVMSANV RKSDGSEAFK PYIIKDVNGV KVGFLALTTP
TVPTWEKPAN IAGLQFADPV EVAKQYVPQI RAEGAHIVVL LQHTGWEKQP AEATKPEAWL
TDPSTWRDTG SLPGENVSIK LAQEVPGVDV ILTGHSHLSV PKAIINNVLL IEPSYWGRAL
GKVTITVEKN GDSWNVVNKD STNISVTNVA EDQEIKALVQ PYHDQTLSYI SQPVGTASAE
FAGGPKARYR DSALADLINN VQKQAAADAG YPVDLSLAAI FTDGGMIPAG QITLRDAYSI
YIYDNTLYVM EINGDILRRA LERNAEYFRQ LDPNALPSDP KAVVNDNARD YNWDLYTDID
YSYDLTKPAG QRVTKLQLNG VDITPEQTLR IAINNYRAGG GGGFAMFREG KIVYQSTSEI
RDLIAESVKN AGTIDPTVVN KVNFTLVPDL YAHYFGAASQ PTATPVPAQP TATPAPGVPI
TLPDTSGNQP SYAWVWAAVA MALLALGLVV RRN