Gene Plim_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4047 
Symbol 
ID9140767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5199690 
End bp5200745 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF1559 
Protein accessionYP_003632057 
Protein GI296124279 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAACG AATCATCGCT GTCACGTCGC CGCCTTCATG CCGGCTTCAC TTTGATTGAA 
TTGCTGGTGG TCATCGCCAT CATTGCCATT CTGATTGCCC TGCTCCTTCC TGCAGTCCAG
CAGGCACGCG AAGCGGCCCG GCGAACGCAA TGTAAGAACA ATCTCAAACA ACTGGGGCTG
GCAGTTCACA ACTACGAATC GGCTCTCAAT GCGTTTCCAC CCTCGGCGAC GATCAATACG
TCGGTGACAA CCACGGGAAA CAATGGGTCG TGGTCGATTC ATGGGAGAAT TCTCCCTTAT
CTGGAACAGG GCAACCTGTA TTCCAAAGTG GACTTGAGCA TTGCGTGGGA TAATCAACTC
GCGATCAGCG GATTGAAGAT TCCCAGCTAC GCCTGCCCAA GTGACCCCAA GTCGGATACA
GTACGCGATC CCGGTGGCGG ACGGGCACTG CTTTATCCCA CCACCTATGG CTTTAACTAC
GGCACTTACT TTGTCTTCAA CCCCACCACT GGTCAGGGTG GGGATGGAGC GTTCTTTCCC
AATAGTAAAC TGAGTTTCAA CTCTTTTACC GATGGTACCA GCAATACGCT GCTCGCAGCG
GAAGTGAAGG CTTGGACTCC TTACAACCGC AATGCTTCGA GTTTTTCGTC GACGACTCCA
CCGACCAATG CCTCGGCTGC GGCAGCTCTA CTAAGCCAGG GAACTGATCC CAAATACAAC
CCTGCAACAG GGCACACCGA ATGGCCAGAT GGTCGAGTCC ATCATGCTGG TTTCACCACC
TGCATGAATC CCAATACGAA TCTTTCAACA ACGCATACCG ATGGAGTGAC CTATAACGAT
TGCGATTTCA ACTCGTGGCA GGAAGGTCGC AATGGAAGCA CAGGTTCACC AAGTTATGCA
GTCATCGTCT CACGCAGTTG GCACGAGGGG ATCGTGAATG TCTGTATGGT CGATGGCTCA
GTGCGTACGG TGAGTGAAAA CATCGACAAT GGGATCTGGC GGGCCTTAGG AACCCGCAAT
GGTGGCAGCA ACGAAACCAC CGTCGGCGAG TTCTAA
 
Protein sequence
MMNESSLSRR RLHAGFTLIE LLVVIAIIAI LIALLLPAVQ QAREAARRTQ CKNNLKQLGL 
AVHNYESALN AFPPSATINT SVTTTGNNGS WSIHGRILPY LEQGNLYSKV DLSIAWDNQL
AISGLKIPSY ACPSDPKSDT VRDPGGGRAL LYPTTYGFNY GTYFVFNPTT GQGGDGAFFP
NSKLSFNSFT DGTSNTLLAA EVKAWTPYNR NASSFSSTTP PTNASAAAAL LSQGTDPKYN
PATGHTEWPD GRVHHAGFTT CMNPNTNLST THTDGVTYND CDFNSWQEGR NGSTGSPSYA
VIVSRSWHEG IVNVCMVDGS VRTVSENIDN GIWRALGTRN GGSNETTVGE F