Gene Plim_3741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3741 
Symbol 
ID9140459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4811209 
End bp4812465 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content56% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003631752 
Protein GI296123974 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGAT TTCCTGCGGA TCGATCGGCA GGCGCTCCCC CTGGGACAAT GAACCCTCAG 
AACCGGTCGG CAACTGCTCC CCTTCTGACA AAGGGGACTC GAAAACCCGC GTGGAAGCAG
CAGCAGTGGT TGATACCCAG TCTGATCTGG GGTTCGTTGG CAGCCGTTGT GCTGGGTATG
CACTTTTACT TCCCCCCGAT TCCTGCTCCT CGAACTGCGA CGAGTTACAG TGCTTCTGCC
GATGGTTTTA AGGCGTTGTA CGAGATTCTC GAACAGGATG CTTTTGTCTA TCGCAACGAT
GCACCACTGG ATCGACTGAT GGAACTGGTT GACCCCGATG GCACACTCTT GTTGATTTTG
AACCCGCCGC GAATTCCGAA TGAAGCCGAG TGGAACAGCC TCTATTCGTG GGTCAACATG
GGTGGTCGCC TGGTCTATGC GCCTCCTCCG GGTGAAGTCG ACTCTTTGGG CCCATTCGAT
GGCGAAATCA CGCCCGAACA AGGCCCGGCT GACGACCGCA TTCCACCGCA GTTGAACCTC
CCCTTAGGAG GCCGGTTTCT GTGGTGGCCG GAAGGGGAAG TGACATCGAC TCCCGGCGGA
AAGGTGCTTG TTTCTCAGGA TGGCTCGCCT CAGGCTGTCA TGGTGAATGC GGGGCGAGGG
TCGGCTCTTT TTGTTGCCAG CCCGTGGATC TTTTCCAACC AGCTTCTGAC CTATGGCGAT
AACAGTGCTC TGGCTTTTGA ACTGATTCGC GAAGCTGCCG GGCCGGGGCA AAGCCTGGAC
GATGTCGTCA TCGCTTTTGA TGAATCTCTC AATACCCGGG CGACACCCCA GATGATGGGT
GTCTTGTTTC AGCCGCCACT GCGTTCGATT TCCGTGCAGA TTCTGCTGCT GTTCATGCTC
TATGGCTGGT GGAACAGTTG CCGGTTCGGG CCCACGGTTG TTCTCGAAGA AACGTCTCAG
CGGGAAATCG TCGAACACAC CAGTGCTCTG GGGCGAATTC TCTGGCGATC AGCGGATTGC
CAGTTCGTGC TATTTCAGTA TTTAAGGTAC TGGCTGACGG AATATCGACT GCAGGAAGCT
TCCGGTCGCA AACGCCGCCT GTCGAGTCGC TTACAGAATG ATGCTCAACA GGTCGATCAG
GCACTGGAAG CGATCCATCA GGCCGAGATC GCTGCGATGA CACCCCGGCT GGGCCATCGC
GAAGCGGCAC GCCATATTCG AGCACTTTCA TTGATTGGCC AGAGCCTGCA GCGCTAA
 
Protein sequence
MSRFPADRSA GAPPGTMNPQ NRSATAPLLT KGTRKPAWKQ QQWLIPSLIW GSLAAVVLGM 
HFYFPPIPAP RTATSYSASA DGFKALYEIL EQDAFVYRND APLDRLMELV DPDGTLLLIL
NPPRIPNEAE WNSLYSWVNM GGRLVYAPPP GEVDSLGPFD GEITPEQGPA DDRIPPQLNL
PLGGRFLWWP EGEVTSTPGG KVLVSQDGSP QAVMVNAGRG SALFVASPWI FSNQLLTYGD
NSALAFELIR EAAGPGQSLD DVVIAFDESL NTRATPQMMG VLFQPPLRSI SVQILLLFML
YGWWNSCRFG PTVVLEETSQ REIVEHTSAL GRILWRSADC QFVLFQYLRY WLTEYRLQEA
SGRKRRLSSR LQNDAQQVDQ ALEAIHQAEI AAMTPRLGHR EAARHIRALS LIGQSLQR