Gene Pden_3856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_3856 
Symbol 
ID4582407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008687 
Strand
Start bp988892 
End bp989887 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content65% 
IMG OID639771165 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_917618 
Protein GI119386563 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.215076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAAT TTCTTGCTTC GACTACGCTT GCCTTCATGG CACTGGCCGT CCCCGCATTC 
GCCGCTGACG AGGAATGCGG CAGCATCACC GTGGCAGAGA TGAACTGGGC CTCGGCGGGG
CTGGCGGCGT GGGTCGACAA GATCATCCTC GAAGAGGGCT ATGGCTGCGA CGTGGCGCTG
GTCTCGGGCG ATACGATGCC GACTTTCGCG TCGATGAACG AAAAGGCCCA GCCGGACATG
GCCCCCGAGT TGTGGGTCAA CGCGGTCAAG GAGCCGCTGG ACCAGGCGGT CGCGGAAGGG
CGGATCGTGA TCGCCTCGCA GATCCTGTCC GATGGCGGGG TCGAGGGCAT CTGGGTGCCG
ACATGGCTGG CCGAAGAGCA CAACATCCAC ACGCTCAAGG ACGCGCTGGA ACATCCAGAA
CTGTTCCCCG GCGCCGAGGA CAGCAGCAAG GGCGCCTGGT TCGGCTGTCC CTCGGGCTGG
GCCTGCCAGG CGATCAACCG CAACCAGTTC GTCGGCTCGG GCGCTGCGGA CAAAGGGTTC
GAACTGGTCG ATTCCGGCTC GGCCGCCGCG CTGGACGGCT CGATCGCGCG GGCGGCGAAC
CGCAAGGAGG GCTGGCTGGG CTATTACTGG GCACCCACTG CCATCTTGGG CCAGTATGAC
ATGACCCGGC TGGAACTCGA GGCCGAATTC GACCGCGAGC GCTGGGACAA TTGCATGGTC
AAGCCCGACT GCGTCGATCC GCAGGTGACG GAATGGCCGG TCTCGGACGT CTATACCGCC
GTGACCAAGG AGTTCGCCGA CAAGGCCGGC GTGGCCATGG ATTACGTCAA GACCCGGGCC
TGGAGCAACG AGACCGTCAA CGCCATGCTG GCCTGGATGG TCAAGAACCA GGCCAGCAAC
GAGGATGCCG CCTATGAGTT CCTCGAACGG CACGAGGACA TCTGGACCGA ATGGGTTCCC
GCCGAAGTGG CCGACAAGGT CAGGGCCGCC CTTTAA
 
Protein sequence
MGKFLASTTL AFMALAVPAF AADEECGSIT VAEMNWASAG LAAWVDKIIL EEGYGCDVAL 
VSGDTMPTFA SMNEKAQPDM APELWVNAVK EPLDQAVAEG RIVIASQILS DGGVEGIWVP
TWLAEEHNIH TLKDALEHPE LFPGAEDSSK GAWFGCPSGW ACQAINRNQF VGSGAADKGF
ELVDSGSAAA LDGSIARAAN RKEGWLGYYW APTAILGQYD MTRLELEAEF DRERWDNCMV
KPDCVDPQVT EWPVSDVYTA VTKEFADKAG VAMDYVKTRA WSNETVNAML AWMVKNQASN
EDAAYEFLER HEDIWTEWVP AEVADKVRAA L