Gene Pnuc_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnuc_1996 
Symbol 
ID5052192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 
KingdomBacteria 
Replicon accessionNC_009379 
Strand
Start bp2067258 
End bp2068445 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content46% 
IMG OID640472170 
Producthypothetical protein 
Protein accessionYP_001156771 
Protein GI145590174 
COG category[S] Function unknown 
COG ID[COG1565] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTA CCTTGACCAG CCTAGAAACG GAGCATAGCG AGCTCCTCAG GGCCAAAATA 
AGCTCTCAGA TCAACTCAGA GGGCGGTTGG ATACCCTTTT CCCGTTTTAT GGAAATGGCC
CTATATGAGC CTGCAATGGG CTATTACAGC GCCGGGGCCC ATAAGCTGGG ATCAGGGGGT
GATTTCACAA CGGCTCCCGA GCTAAGCCCC TTATTTGGGG CTGCGATTTG TTCAACCCTA
CTACCAGTAC TGGAAGGCTT CAAAGCACAG GGTCTACCTA CGCAAATTCT GGAGTTTGGC
GCAGGGACGG GCAAATTGGC TAGCTCGATT CTGACTCGCC TTCATGACCT CGATTTCAGC
CTGGATCGAT ACGACATCCT AGAAATCTCT CCTGACTTGG CGCAGCGTCA AAAAGAACAC
ATTAGCAAAA CAGTTGATCA ACTGAACTCA TCTACTCAGT GTGACTGGTT GAAAGCATTG
CCGCAGAATT TCAAGGGAGT GATTCTTGCA AACGAAGTCA TTGATGCGAT TCCTTGTGAC
GCAATTGTTT ATCAAAATGG ATTTTGGTAT TGGTATGGAG TGGCACTAAA TGATGGCAAG
CTGATTTGGA AGACGGGATC TCCGGTAGAG CAGGATTTAC TACCTGAGAG TTTATTGAGC
GCTAGCTTTT CAGAAAGTTA CGTCACCGAA CTTCATGCGC CAGCGAATGA CTGGATGCGT
CAAGTGGCGA GAAATTTAGA CTCTGGCTTG TTTTTAACTT TTGATTATGG CTTTCCTGAA
GGCGAGTATT ACCACCCGCA AAGACTGGAA GGCACTCTCA TGGCACATCA CCGTCATCAC
GCAATTCAAG ATCCGTTTTA TCTACCAGGC TTATGTGATA TCACCACTCA CGTGGAGTGG
TCACAAATTG CTCGTAGCGC ACTGACTGAA AATGCGGATG ATGTCTATCT CACCAATCAA
GCCGCTTATC TACTGGATGC CGGCATTGGT GATATTGCCT TAGAGATTGG TGACCCAAGC
AATCCAGAGA CTTTTTTGCC GATTTCGAAT TCATTACAAA AATTATTGTC GGAAGCAGAG
ATGGGTGAAT TATTTAAAGC TTTTGCCTTC TCAAAAAATC TAGACTCTCT ATTGCCAGGC
TACACACTGG AAGATCTTCC AGGTCTGCGT GGGAGAAATC GCCTTTAA
 
Protein sequence
MDITLTSLET EHSELLRAKI SSQINSEGGW IPFSRFMEMA LYEPAMGYYS AGAHKLGSGG 
DFTTAPELSP LFGAAICSTL LPVLEGFKAQ GLPTQILEFG AGTGKLASSI LTRLHDLDFS
LDRYDILEIS PDLAQRQKEH ISKTVDQLNS STQCDWLKAL PQNFKGVILA NEVIDAIPCD
AIVYQNGFWY WYGVALNDGK LIWKTGSPVE QDLLPESLLS ASFSESYVTE LHAPANDWMR
QVARNLDSGL FLTFDYGFPE GEYYHPQRLE GTLMAHHRHH AIQDPFYLPG LCDITTHVEW
SQIARSALTE NADDVYLTNQ AAYLLDAGIG DIALEIGDPS NPETFLPISN SLQKLLSEAE
MGELFKAFAF SKNLDSLLPG YTLEDLPGLR GRNRL