Gene Haur_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0073 
Symbol 
ID5731946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp95172 
End bp96338 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content48% 
IMG OID641277195 
Producthypothetical protein 
Protein accessionYP_001542853 
Protein GI159896606 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTA AAGTGAAATT TTTCGCGATT TTCATTGGCA TCGTCGCGTT GTCATTTTTC 
TTATTTCGCT CCTGCGATCC TCAAGCGCGT AACCTGCGGC CTGGCTCGAT GTACAACCAA
GGTGGCGACG GAACTGCCGC TTTGCAGCTT TGGCTGAGCA AAATTGGCTA TCAAACTGAG
TCATTTGAAT ATCGCGATTT CGATGAGCTT GACCAAACAA TCGATACTTT GCTTGTAATC
AAGCCAAGCG ATACTAATAA TTGGCGCAAG GAAGAAATTG ATGCGGTTTT AGCGTGGGTT
GAAGATACCG GCGGCACGCT GATTGTCGCT GATGATCAAC AAAATGGCCT TTTGACCCGC
TTGGATCTGA CGGTCACCCG CATCGAAGCC TTGGAAATGG TCAGCACCAG TGATACCAGT
CATGCTTTGG TGAACCCGAT AGTAAACGGT TTGCAAGGCT ATCAAACAAT TAGCTATTTT
GAGCGAGTGA GACCCAATAG CCAAGTGATT GTTGGTAGCG AAGCACAGCC GACGACGCTT
GGCATTAGTC GTGGCCGTGG GATGATCTAT GCTTCAACCA ATATTTGGCT CTTTACCAAT
GCTGGTTTGT TTTATGAAAG CAATGCCAAA ATTATCCTTA ATATGGTTAA TCGGATGCCC
GCAGGCAGTG TAATTGCCTT TGATGAGGTA CATCATGGCC GTGCCTTACC ACCCAAAGCC
GCTCCTGTGC CAGCCCAACC CTATTCGCCT TTGGTTGCGG CGATGGTCTA TAGCGCAATG
GTTGTGGGCT TATGGGCCTT GCTTTCTGGT CGCCGTTTTG GTCAGATTGT GCCCAGCAGA
ATCGATTTGA TGCGACGGAA TAGCAGCGAA TATGTCCAAT CGATGGCCAA TTTGTTTCAG
CGCGGTCGTC AAGCCGAACA TATGCAAGCC CACTATAAAA CCTATCTCAA ACGCCGAGTT
GCTAAGCCCT ATGGGATTAA CCCCAAGCTT GATGACCAGA GCTTTTTAAG TGAAGTTCAG
CGGTATTCCG ATACAATCGA TCGTAATCAC TTAGCTCATT TGCTCAACCA TTTAAGCCAG
CCCAACCCCA GCGAGGCGAC GATCTTGGCC CTGGTCAACG ATATCGATCG CTTTATCAAC
CTATGGGAAC AACAGGGTCG GGCCTAA
 
Protein sequence
MNSKVKFFAI FIGIVALSFF LFRSCDPQAR NLRPGSMYNQ GGDGTAALQL WLSKIGYQTE 
SFEYRDFDEL DQTIDTLLVI KPSDTNNWRK EEIDAVLAWV EDTGGTLIVA DDQQNGLLTR
LDLTVTRIEA LEMVSTSDTS HALVNPIVNG LQGYQTISYF ERVRPNSQVI VGSEAQPTTL
GISRGRGMIY ASTNIWLFTN AGLFYESNAK IILNMVNRMP AGSVIAFDEV HHGRALPPKA
APVPAQPYSP LVAAMVYSAM VVGLWALLSG RRFGQIVPSR IDLMRRNSSE YVQSMANLFQ
RGRQAEHMQA HYKTYLKRRV AKPYGINPKL DDQSFLSEVQ RYSDTIDRNH LAHLLNHLSQ
PNPSEATILA LVNDIDRFIN LWEQQGRA