Gene Haur_4367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4367 
Symbol 
ID5736227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5578890 
End bp5580641 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content50% 
IMG OID641281528 
Productprotease domain-containing protein 
Protein accessionYP_001547127 
Protein GI159900880 
COG category[S] Function unknown 
COG ID[COG5276] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000129346 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTTTC AACGTTGGTC ACGTTATTTT AGCGTGGCAA TTTGCCTAAT CCTAATCGCT 
GCCTGTGCCA ATCAAAATGC GACAGCGATT CCTGCGACGG CAACAATCAA CAACAATGCT
GGCGAGCAAG CTCGCTCTGA TAAAGATGGC GTGAATCAAA GACCAACCAA AACGCCGCGA
CTCGCTGCCA CGGCGACTCC AACCATCACA GGGCCGCATT TCGAGCAAGT TGGCAGCTTG
CGACTTAAGC CACCAACCAG CGGTCGCCAT GCTGATCTCA CGTTGTATAA CGACTTGGTG
TTGCTTGGTA CGCAGCCTGG CTCATGCCCC GCCGAAAATC AAATTACATT AATTGATGTG
AGCGATCCGG CTAATCCAGA GTTGGCGGGC TATTCACCAA GCGTCAAAAA TGCCTCACTT
GAAGATATGG ATGTGGTGAG GATTGGCGAG CAAGATATTG CAGTTTTGGG GATTCAGCCC
TGCCGTGCAG CAACCAAACC TGGCATTCAA ATTGTGGATA TCACAGATCC AACTGAGCCG
CTCGAATTGG CTCGGTTTGA AACCAACCTT GGCGTACATG AACTCGATGT AACAATTACG
GCTAGTGGAC AAGCCTTGGC TTTGCTTGCT GCGCCCACCA ATAACGCTTT TGGTACAACG
CCCAAGCCCG AAGATCGTGG CGAATTGTGG ATTGTTGATA TTAGCGACCC TAGCCAACCA
ACCATGCTCA GCCGTTGGGG CATCGATCAA AAGCCCGATT GGCAAGCACT TGATTATGCA
GATCGGACTC GTGGCAATTT CCCAGGGATC TTTTTGCATA GCGTGCGGGC CAGCAGCAAC
AGCCAACGCG CCTATCTCTC GTACTGGGAT GGCGGCGTGA TTATCTTGGA TATTCAAGAC
CCCAATCAAC CAATCTACCT TGGCCAAACG CCCTATCCCG CGCTAGCCGA GGGCGATGCA
CACTCTGTCG TCGATTGGAA TGATGGGCAA ATGTTGGCCT TGAACAACGA AGATTTTAGC
AACGATCAGG CAAAAATTAG TCACCCAGCC TTGGCCGAGC CAGATTACGC CCACGAATTA
CCATTTGGCG GCAAACTTGA TGCCCCGCTG AGTGCCAAGG TTTTAGCGCT TGGGCAGGCT
TGCGATGCTG AAGCTGAATA TCCTGATTTC AAAGGTTTTT TTGTGTTGGC CGAGATGGCG
GGCTGTTCGA TTGAGCAAAA GCTGCAAATT GCCCAAAACG GCGAGGCCGC AGCATTATTG
ATTTATGCTA ATACACCATT TGAGGAGCTG CAATTTGGCG ATGATGTTGA TTTGATCGAT
GATTTTGATC TGCCGATGTT TACCATTAGC AGCACAACCG CCGCTGCCTT GCTCAGCCAA
CCTGAGGCTG AAGCAACGAT CGAAAGCTAT TTTGATGGTG GCGGCGCAAT TCAATTCTTC
GATCTGAGCA ATCCCAGCCA GCCAGTTGAA GTTGGGCGCT ACAATACGCC AAATTCGATT
AATGAGACGT TGCGATCCAA CCCAACTGTG CATAATTCTG AGGTGCAAGG CCAATATTTG
TATGCCTCGT GGTATCAAGA TGGCCTGCGC ATGCTCGATA TCAGCGATCC CAGCCAGCCC
AAATCAGTGG CAAGCTGGCC GCTGAACAAT TCGCCAAAAG TGGCCTTGTG GGGCGTACAA
GTGCGCGATC AATTTGTCTA TGTCAGCGAT TTCAGTTATG GCTTGTATAT TTTGGAGTTT
AAGGCCGAGT AA
 
Protein sequence
MLFQRWSRYF SVAICLILIA ACANQNATAI PATATINNNA GEQARSDKDG VNQRPTKTPR 
LAATATPTIT GPHFEQVGSL RLKPPTSGRH ADLTLYNDLV LLGTQPGSCP AENQITLIDV
SDPANPELAG YSPSVKNASL EDMDVVRIGE QDIAVLGIQP CRAATKPGIQ IVDITDPTEP
LELARFETNL GVHELDVTIT ASGQALALLA APTNNAFGTT PKPEDRGELW IVDISDPSQP
TMLSRWGIDQ KPDWQALDYA DRTRGNFPGI FLHSVRASSN SQRAYLSYWD GGVIILDIQD
PNQPIYLGQT PYPALAEGDA HSVVDWNDGQ MLALNNEDFS NDQAKISHPA LAEPDYAHEL
PFGGKLDAPL SAKVLALGQA CDAEAEYPDF KGFFVLAEMA GCSIEQKLQI AQNGEAAALL
IYANTPFEEL QFGDDVDLID DFDLPMFTIS STTAAALLSQ PEAEATIESY FDGGGAIQFF
DLSNPSQPVE VGRYNTPNSI NETLRSNPTV HNSEVQGQYL YASWYQDGLR MLDISDPSQP
KSVASWPLNN SPKVALWGVQ VRDQFVYVSD FSYGLYILEF KAE