Gene Oter_4229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOter_4229 
Symbol 
ID6207744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOpitutus terrae PB90-1 
KingdomBacteria 
Replicon accessionNC_010571 
Strand
Start bp5454785 
End bp5456065 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content64% 
IMG OID641693897 
Producthypothetical protein 
Protein accessionYP_001821102 
Protein GI182416036 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.527328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.405756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAA AACTCGTTTC CCTACTCGGC GCCGGGGTGT TTGCCCTGAG CGCTTTGGCC 
GGCTCCCTCG CCGCGGCTGA GACCGTCAAG GTCGGCGTGC TGCATTCGCT CAGCGGCACC
ATGGCGATCA GCGAAACCTC GCTGCGCGAT GTGCTGCTGT TCGCCTTCGA CGAGATCAAC
GCGCAAGGCG GCGTGCTGGG CCGGCAGATC GAGCCGGTCG TCGTCGACGG CGCATCCAAC
TGGCCGCTGT TCGCCGAAAA GGCGAAGCAG CTGCTGGAGC AGGACCAGGT CGCCGTCGTG
TTCGGCTGCT GGACGTCCGT AAGCCGCAAG TCCGTGCTGC CGGTGTTCGA GAAAAACAAC
GGGCTGCTGT TCTACCCCGT CCAATACGAG GGCGAGGAGG AAAGCCAGAA CGTCGCCTAC
ACCGCCGAAG CGGTGAACCA GCAGGCGACG CCGGCGGTCG ACTACTATCT CGCCGAGGGC
AAGACGAAGT TCTACCTGCT CGGTTCGGAT TACGTTTATC CGCAGACCAC CAATCTCGTG
CTGCTCGAAT ATCTGCTGAG CAAGGGCGTG CCGCTCGAGA ACATCGGCGG TGGCTTCAAG
CGCGACGAGT CGGGCCGAAT CATCTCCGCC GGCAAATACA CGCCGTTCGG CCACACCGAC
TACCAGCAGA TCGTCGCCGA GATCAAACAG TTCGCCGCCT CCGGCGACGC CTGCGTCATC
AGCACGCTGA ACGGCGACAC GAACGTTCCG TTCTTCAAGG AGTACGCCGC GGCGGGGCTG
ACGTCCGACA CCTGCCCGGT GGTGTCGTTC TCGATTTCGG AGGATGAATT TCGCGGCCTG
CCGGCGAAGC AGCTCGTCGG TCAGCTGGGC TGCTGGACGT ATTTCCAGTC GCTCGATACG
CCGGCCAACA AGCAGTTCGT CGGCGCATTC CAAAAGTGGC TGGGCACGAC GAAGGTGCCG
GGCATCGTGA AGGAGGGCCG CGTGACCTGC TCGCCGATGG TGCTGAGCTA CGCCGGCGTG
CATCTCTGGA AGGCCTGCGT CGAAAAGGCG GGAACCTTCG ACGTCGACGC GGTGCGCGCC
GCGTGGAAGA GCGGCGTGTC GTTCGACGGG CCGGGCGGCC GCGTGACGAC GCAGCCCAAC
ATGCACCTCA CGAAAAACGT CTACATCGGC GAGACGCGCG CCGACGGCCA GTTCAAGATC
GTGAAGTCGT TCGACAACGT CGTCGGCGAG CCGTGGCTCA AGGGCAAGTT CAAGGCCGCG
GCCGTCGCGA CGGCCCAGTA G
 
Protein sequence
MTKKLVSLLG AGVFALSALA GSLAAAETVK VGVLHSLSGT MAISETSLRD VLLFAFDEIN 
AQGGVLGRQI EPVVVDGASN WPLFAEKAKQ LLEQDQVAVV FGCWTSVSRK SVLPVFEKNN
GLLFYPVQYE GEEESQNVAY TAEAVNQQAT PAVDYYLAEG KTKFYLLGSD YVYPQTTNLV
LLEYLLSKGV PLENIGGGFK RDESGRIISA GKYTPFGHTD YQQIVAEIKQ FAASGDACVI
STLNGDTNVP FFKEYAAAGL TSDTCPVVSF SISEDEFRGL PAKQLVGQLG CWTYFQSLDT
PANKQFVGAF QKWLGTTKVP GIVKEGRVTC SPMVLSYAGV HLWKACVEKA GTFDVDAVRA
AWKSGVSFDG PGGRVTTQPN MHLTKNVYIG ETRADGQFKI VKSFDNVVGE PWLKGKFKAA
AVATAQ