Gene Dret_1881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1881 
Symbol 
ID8419724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2157941 
End bp2159623 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content66% 
IMG OID645038467 
ProductPeptidoglycan-binding lysin domain protein 
Protein accessionYP_003198743 
Protein GI258406001 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3170] Tfp pilus assembly protein FimV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.210829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000286942 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGCTGGT CGATCCCCCT TGTTTTCGCT CTCTTGGGAG CCATCCTCCT GTCAGCCGCT 
CCGGCCCGGA CAGCGCCGCG CCTGTTCTTC AACAAAAACG TCGACGCTCC GGCCGGCTAC
CAACTCTACA CCGTCCAGGC TGGGGACACA TTGTTCCGCA TCATGCGCGG TCGAGGCTGG
GACAACGCCG CCATCGAACA CGCCCTGCCC ACCATCGACC GCGCCAACCC CCATATCCCG
GATCTGGACA CCCTCCGGCC CGGACAGCGC ATCTTCCTGC CTCCCCCGCG AGGCGAAGCC
CCTTCGGCCC CAGGCCCCAC CCTGCCCGAC TCTGTCAGCC AGGTGCCGTA TACCGTCCAG
CCCGGGGACA CGGTCTTCAC CATTCTGCGC CGCCGGACAA ACCTGCCCGT GCGTGCCATC
CACCGCCGCT ATCTCGACTA TTTCCGCCAG GCCAATCCTG AACTCGACAA TATCGACCGC
ATCTATCCCG GTCAGCAACT GCGGCTCCCG GTGCCCAAGC CATCGCCCGT CGCCTCGGAC
GCAGCGACAA ACAGCACAGC CCCCGACCCA GCGTCGCCAT CTCCCCGTGC AAGCGCCAAC
GCCACCGAGC CTTCCACGCC GACTGCCGCC CCTACAGCTT CGCCCAAGAG TTCAAACAGC
ACGGTCACAA CACCCAAGAG AACACCCTCC GACGCCGACC TGGCCACCGC CGTGGGCAAG
GGGCCGCAAC CCGGAGCCGG TTCACAAACG ACGACCGGTG CAGACTCGGC GCCACCTCCG
CCCAATGCAG CCGGCGCGCC CCTGCCAGAA GGCCCTGACG CACCCCGCAC CACCACGGAC
GCCGCTTCGC CCACTCCGGC TCAAACTGCT GCCGCTAGCT CGTCCGAGTC TGCACCTTCC
CAGGCTGAGC CCAACGACGC TTTTTATGCC CTGGCCAAAC AGGCTGGCTA CCAGCCCGTT
CGCGGCGGGA CTCGATACTT CCCCACTGAC AAAGGCTGGC TGCAAATCGA CACCGAGCGC
ACTCCGCTTC TCAAGACCCC CTCCGGGGAG ACCCTCATCC TCGTCCCCGG AGAGGATACC
CAACGCTATG CAGACGCCGG TCTGACGCCC CTTTCTGTTC CCGCCAACTG GGAGACCGAA
GACGCCCTCA AAGCCCTGGA CCGGACCGAT CCTGAACTGG TCCAGCTCTG GCCGCGCAAC
AGACCGCTTA TCACTCACCG CCGCCACCTC GGCCTGGAAC TGCGCGGCAA ATGGATCTTT
GTCGACAAGG GACAGACCCC GTCGCGCATC TACGTCCTGA TCGATCCCCA GCGCCAGGTG
AGTTTCGAAA CCCGAGTCCT GGCCGGGCTG CTCCAACAAC GCAACATCAT TCTCAGCACA
TGGGACGCCC CTTCCCAGGA ACTCCTGCCG GTCCAGCCGC CGCCCCAAGA GCAGATCCTC
GTCCCCCACC TGCGCATGGC CGAATTGCGG CAGGAATTCG GCGGACGGGT CCCCGTGTCC
GGGATGCACG CCCGGCGCCA CCTCTTTCTG CCCCGATCGG GTGCCCCTAT CGGCATCACC
TTTCGCTGCC GCACCCTGCG CCGCGGCCAG GGCCCGGAGC TGATCTTTCC CCCCGGGACC
GATATCTACC TCCCGGCCCT GCTCCACCTC ACCGGCCGCC CGTGCTGGAT CTTGGGAGGG
TGA
 
Protein sequence
MRWSIPLVFA LLGAILLSAA PARTAPRLFF NKNVDAPAGY QLYTVQAGDT LFRIMRGRGW 
DNAAIEHALP TIDRANPHIP DLDTLRPGQR IFLPPPRGEA PSAPGPTLPD SVSQVPYTVQ
PGDTVFTILR RRTNLPVRAI HRRYLDYFRQ ANPELDNIDR IYPGQQLRLP VPKPSPVASD
AATNSTAPDP ASPSPRASAN ATEPSTPTAA PTASPKSSNS TVTTPKRTPS DADLATAVGK
GPQPGAGSQT TTGADSAPPP PNAAGAPLPE GPDAPRTTTD AASPTPAQTA AASSSESAPS
QAEPNDAFYA LAKQAGYQPV RGGTRYFPTD KGWLQIDTER TPLLKTPSGE TLILVPGEDT
QRYADAGLTP LSVPANWETE DALKALDRTD PELVQLWPRN RPLITHRRHL GLELRGKWIF
VDKGQTPSRI YVLIDPQRQV SFETRVLAGL LQQRNIILST WDAPSQELLP VQPPPQEQIL
VPHLRMAELR QEFGGRVPVS GMHARRHLFL PRSGAPIGIT FRCRTLRRGQ GPELIFPPGT
DIYLPALLHL TGRPCWILGG