Gene Haur_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3921 
Symbol 
ID5735782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4913860 
End bp4914942 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content48% 
IMG OID641281072 
Productputative ribonuclease BN 
Protein accessionYP_001546683 
Protein GI159900436 
COG category[S] Function unknown 
COG ID[COG1295] Predicted membrane protein 
TIGRFAM ID[TIGR00765] YihY family protein (not ribonuclease BN) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00855127 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCAA GTATTATTAA TTTATTTAAA CAAACATTTA AAGAGTGGGG CGATGACAAA 
GTGCCACGGC TTGGGGCTGC CTTGGCCTAT TACACGGTAT TTTCGCTTGC GCCACTGTTA
ATTATTGCAA TTAGCATTGC TGGCTTGGTG TTTGATCAAG AGGCCGCTCG CGGTGAGGTT
ACCCGTCAAT TAGCCACGTT AATTAATGAC GATGCGGCCC AAGCGATCAA CGAAATTATT
CAGCAATCGA GTAACCAGCG CTCAGGCATT ATCGGCACAT TAATTGGTGT GGCAACCTTG
CTCTTTGGTG CTTCCGGCGT TTTTGGTCAG CTCAAAGATG CCATGAACAC GATTTGGGGG
GTGCAACCAA AGCCTAGGCG CGGGATTTGG GGCATCGTCC AGGAGCGCTT TTTCTCGTTT
ACGATGGTGC TGGGCGTGGG CTTTTTGCTC TTAGTTTCGC TGATTATCAG CACATTGCTT
GAGGCTGGCA AAAATTGGCT GTTTGGCGCA GAGATTGGCA TTGTGTTTCA GATTATCAAT
TTAATCGTTT CATTTGGCAT TATCACCGTC GTTTTTGCCC TGTTGTTCAA ATTTTTGCCT
GATGTTAAGA TGGCTTGGCG TGATGTTTGG ATTGGTGCAG CACTCACGGC ATTGCTCTTC
ACCATCGGTA AATTTGCGAT TGGTCAATAT CTGAGTACCA GCAGCACCGC CTCGACGTTT
GGCGCGGCTG GCTCATTGAT TATTGTGCTG TTGTGGGTCT ATTATTCGAG CCAAATTCTC
TTTTTTGGAG CTGAATTGAC CCAAGTCTAT GCCAATATGT ATGGCTCACA CGTGCAACCC
GATGATGATG CTGTAGCCGT GACTGCCGCT GCTCGGGCTG AACAAGGCCT GAGTAATCCT
CATTCGCCGC GTGATCAGCG CACGCCGCGA CCGAAATTGG CCGCCAGCAA ACCACAAATT
ATTGTCACTG AACGCTTTAA ATCATTGGAA AAACAACGCT ATCTAGCAGC AGTTTTGGGC
TTTCTGCTGG CAGTGGTCGC AGGGGCATTC AAGAGTTTAC GCAACGATAA ATCGCCATCT
TAA
 
Protein sequence
MQPSIINLFK QTFKEWGDDK VPRLGAALAY YTVFSLAPLL IIAISIAGLV FDQEAARGEV 
TRQLATLIND DAAQAINEII QQSSNQRSGI IGTLIGVATL LFGASGVFGQ LKDAMNTIWG
VQPKPRRGIW GIVQERFFSF TMVLGVGFLL LVSLIISTLL EAGKNWLFGA EIGIVFQIIN
LIVSFGIITV VFALLFKFLP DVKMAWRDVW IGAALTALLF TIGKFAIGQY LSTSSTASTF
GAAGSLIIVL LWVYYSSQIL FFGAELTQVY ANMYGSHVQP DDDAVAVTAA ARAEQGLSNP
HSPRDQRTPR PKLAASKPQI IVTERFKSLE KQRYLAAVLG FLLAVVAGAF KSLRNDKSPS