Gene HS_0412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0412 
SymboldprA 
ID4239888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp437995 
End bp439101 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content38% 
IMG OID638103955 
ProductDNA processing chain A 
Protein accessionYP_718622 
Protein GI113460558 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.232612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAC GGAATGAGCT GTTACTGCGT TTGTTACAAG TGCCGAAGTT GGGAGCGTCC 
GGAATAACTC AACTTTTATC ACACATCGAT TTAAAAACAT TAGAGGATTA TGATGATGTT
GCGTTTCATT ATTTAGGCTG GAAACCTGAA CAGATAAATA GATGGTTGTA TCCGGATCTT
AGATATATTG AACCTGCATT ATTTTGGGCA AGCAAAAAGG GACATTATTT GATGAATTTC
TATCAAGAGA ATTACCCTTA TTTATTAAAG CAAACCATAG GTGCTCCCCC TTTGTTATTT
ATCAAAGGCA ATCCGGAAAT ACTTGCTCAA CAACAAGTCG CTATTGTAGG AAGCCGTCAT
TGCTCACATT ATGGCGAGTA TTGGGCAAAG TATTTTGCAA CGGAATTGTT CCTTGCCGGT
TTTGTTATTA CCAGTGGACT GGCTTTAGGG ATTGACGGTT TTTGTCATCA GGCTGTTGTG
GATATTCAAG GACAAACTAT CGGGGTATTA GGGGGAGGTT TGGAAGAGTT GTATCCCAAA
CAACATAAAA AATTAGCACA ACAAATGCTA GATTATGGCG GTGCGTTAGT GTCAGAATTT
TTACCTCATC AGCCTCCCAA ACCACAACAT TTTCCTCAAC GTAATCGCAT TATTAGCGGA
CTTTCTAAAG GTGTTTTAGT GGTTGAGGCA ACAGAAAAAA GCGGTTCACT CATTACTGCA
CGTTATGCTT TAGAACAAAA TAGAGAAGTT TTTGCACTTC CAGGACAAAT TCAAAATGAG
TATAGTCAAG GGTGTCATAG ATTAATAAAA GACGGTGCAT TGTTAGTAGA GAATGTTGCA
GATATTGTAG AAAATTTATC GCCTTTTATG CATTATGAAC GTCAACTAAC AGCAAAACAG
ATAGAAACGC AATTTCCGCC GGCTTATAAG TTACCTGCTT CACCGACTTA TCCTGAACTC
TATGCCCATA TTGGTTATAC GCCGGTAGGG CTTGATGAAT TATCAAATAA AAGCGGATTA
AGTGTAGATA CCTTATTGAT ACAGCTGTTA GAACTTGAAT TGCAAGATCT CGTTATTGCT
GAAAAAGGGT TATATCGACG GACTTAA
 
Protein sequence
MDKRNELLLR LLQVPKLGAS GITQLLSHID LKTLEDYDDV AFHYLGWKPE QINRWLYPDL 
RYIEPALFWA SKKGHYLMNF YQENYPYLLK QTIGAPPLLF IKGNPEILAQ QQVAIVGSRH
CSHYGEYWAK YFATELFLAG FVITSGLALG IDGFCHQAVV DIQGQTIGVL GGGLEELYPK
QHKKLAQQML DYGGALVSEF LPHQPPKPQH FPQRNRIISG LSKGVLVVEA TEKSGSLITA
RYALEQNREV FALPGQIQNE YSQGCHRLIK DGALLVENVA DIVENLSPFM HYERQLTAKQ
IETQFPPAYK LPASPTYPEL YAHIGYTPVG LDELSNKSGL SVDTLLIQLL ELELQDLVIA
EKGLYRRT