Gene Haur_4490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4490 
Symbol 
ID5736341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5750015 
End bp5751271 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content53% 
IMG OID641281653 
Productdipeptidyl aminopeptidase/acylaminoacyl-peptidase-like 
Protein accessionYP_001547250 
Protein GI159901003 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATGGT TCAAGCGTTC TTGGCGCAAC ATTCTAGGAA TTATGGTTGG CCTTTTGGCA 
ATTGGTTTGG TCTGGCTGCT CACCACTGAT AACGTGACGA TCTGGCCGAT TCGCAATACG
CTGCGTTATC AATTCGACCA ATGGCGAACC GACAGCCAAA TGCCAAGCCA GCCAATCGCT
GATCGCAGCA TCGTTGGGTG TATTACCAAT GCGCAGCAGC AACCAATCGT TGGGGCAATT
GTGGCTGTCA GCGAACGCAA CGGCAGATTA CATCGAGCTA TCAGCGATCG TCAAGGCTGT
TATCGGCTTG GCAATGTGCC AGCTAACCAA TATCGCTTAT TGGTGACTGC GCCGAGTTAT
CGCGATGATT TGATCGATGT TGATGTGCAG CAAGCCCAAA CTGAGCAGCA TGCACAACTT
TTGCCGGCAA TTGCCCCAAG TTATGCCCCA GTCGAAAAAC TCGTGCTTGG CCCAAGCAAT
GTAGTTAGCC GAACTACGCC CTACCCTACC CAAGCATTGC GCCAGCAGGT GCAAGTTTGG
AGCGATAACG GCGAGCAGCA ATTGACCCTG CTCTATCGAC CAATTACCGC CACGCAACCG
TTGCCGTTGA TGTTGGCGGT CTACCCTGGC CCTGCCAACG AATGGGAGAG CGTGAGCATT
CCCTTGGCCG AGCGCGGTTA TAGCGTGTTG GCGGTTGGTC CAGCCTACAG CCTCGACCTC
GAAACTGATA TTGCCGATCT CAAGCGCTTG TTGGCGTTGG CGCGGGGTGG CTCGTTCGTG
GGAGTTGATG GCAGCCACAT TGCGATTATG GCAGGCAGTT ATAGCAGCCT GCACGTTTTG
CGCCTGTTGC AAGACGATGT AGGTTTTACG GGGGTGGTAT TGTTGGGGCC AATTAGCGAT
TTGTTTGCCA TGCGCGAGAG CTTTGTGGCC GGAACATTCA TGCCGCCGTT TGGGCTTGAT
CAAGCCCTGA TTGCCCTAGG TTATCCCGAC GAGGAGATTC AGCGCTATGC CAGCTATTCG
GCTCAATTAC ACCCTCGCGC TGATTTGCCG CCAATTTTGT TGATGCACAG TCGGAATGAT
GAAGTTGTGC CCGCCAGCCA ATCAGAATTT TTGGCTGAGC AATGGCGGGG CTTGGGCGTT
GAGGTTGAAA GTTATTTTTT CGATGGCATG TCGCATTATC TGCGGGCGGT CGAACCTTCA
CCAGAGCTTG ATGAGTTGTA TCGCATAACT TTAGATTTTC TGGCACGAGT TAATTAG
 
Protein sequence
MQWFKRSWRN ILGIMVGLLA IGLVWLLTTD NVTIWPIRNT LRYQFDQWRT DSQMPSQPIA 
DRSIVGCITN AQQQPIVGAI VAVSERNGRL HRAISDRQGC YRLGNVPANQ YRLLVTAPSY
RDDLIDVDVQ QAQTEQHAQL LPAIAPSYAP VEKLVLGPSN VVSRTTPYPT QALRQQVQVW
SDNGEQQLTL LYRPITATQP LPLMLAVYPG PANEWESVSI PLAERGYSVL AVGPAYSLDL
ETDIADLKRL LALARGGSFV GVDGSHIAIM AGSYSSLHVL RLLQDDVGFT GVVLLGPISD
LFAMRESFVA GTFMPPFGLD QALIALGYPD EEIQRYASYS AQLHPRADLP PILLMHSRND
EVVPASQSEF LAEQWRGLGV EVESYFFDGM SHYLRAVEPS PELDELYRIT LDFLARVN