Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Meso_1950 |
Symbol | |
ID | 4182517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chelativorans sp. BNC1 |
Kingdom | Bacteria |
Replicon accession | NC_008254 |
Strand | + |
Start bp | 2090851 |
End bp | 2092371 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638067846 |
Product | extracellular solute-binding protein |
Protein accession | YP_674508 |
Protein GI | 110634300 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.412794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAACC TGTCTAAATC CGGCCTGGCC TGGACCTTGC TGGCTGCGCA GGCCGGACTT GCCTTCGCGG CAGCACCGGC CCTCACCGCC GAAGGGTCCA TCGTCCTGGC GATGCCGGCC GAGCCGACCT CGGTCGATGC CTGCGATGAC AGCACTAGGG CCAATGCCCG GGTGCTGCGC GGCAATGTCG TCGAGGCGCT GACGCGGCTC GACCCACAGT CGGGCGCGGT CGGCCCGCTG CTCGCCACCG AATGGTCTAG TCCGGATAAC AAGAGCTGGC TCTTCACCAT CCGGCCGGGC GTCACCTTCC ACGACGGCAC GCCGCTGGAT GCGGCCGCGG TGGCGTTCGG CATCAACCGC TCGATGAACC CGGACCTGAC GTGCCAGACC CTGTCGCTGT TCCCCACCAA GACCACCGCG ACCGTCGAAA GCGACATGGT GGTACGGATC ACCACCGAAG AGCCCGACCC GATCCTGCCG GCACGCATCG CCTATATCGA TCTGCCCTCC CCCAAGACTC CGGAAGCCGC CAAAAGTGAC ACCCCGATCG GCACCGGTCC CTACCGGTTT GCCGGCCGCG AGATCGGCCA GTCGATAACG CTATCGGCGT TTGACGGCTA TTGGGGCGAA GCCCCCGAGA TCGCCGAAGC CAACTATGTC TGGCGGGCCG AGGCCACCAT CCGCGCCAGC ATGATCAAGA CCGGCGAAGC CGATATCGCC TATGATATCC CCACCCATGA GGCCGAAGGC CAGGCCAATG CCCAGCAATA TTTGACCAAC GGCGTGTTCT ACCTGCGCCC GATGCTGCAG AAGCCGCCGC TGGACGATCT GCGCGTGCGC CAGGCCATCG CCTCCTCGAT AGACAAGGCC ACGCTGGCCG AAGTGCTGAT GGACAATTCG GGCACGCCGA CCGGGCAATT GGTCACACAG CTGATCAATG GCTACGTGCC CGATTATACC GGCATGCCCT ATGATCTCGA AAAGGCCAAG GCACTGTTCG CGGAAGCCAA GGCGGCCGGC GTCGCCGTCG ATACGCCGAT CACGCTGGTG GCCCGCACCG ACCTGTTCAG CGGCGCCGAG GAAGTCTCGC AGGCGATCCA GCAGATGATC CAGCAGGCCG GCTTCACCGT CACGCTGAAA TCGGTGGATA CCGTCGGCTG GAGCCCCTGG GCTCGCAAGC CGGACTCGCT TACCCAGCCG GTGAACCTGC TCACCTCGAG CCACAACAAC ATTTCGGGCG ACGGTTCGCT GACCTTCCCG AACTTCCTGG GCAGCGGCGG CCGGCTGAGC GTGGTCGACA ATGCCGAACT CGATGCCAAG CTGGCGGCCG CGGCCAAGGC CAGCGGCGAA GACCGGGCCG CGGCATATCG CGAGATCGCG CAATATGCCT ATGATCAGGA ACTGGTCATT CCGGTTGCGG CCCTGCAGGG GCTGCTGCTG ACCTCGGATC GCATTGCCTA CGAGGCGGAT GGCTTTACCG ACATCGAACT GCATCTGTCC GACGTCAAGC ACAAGCAGTA G
|
Protein sequence | MINLSKSGLA WTLLAAQAGL AFAAAPALTA EGSIVLAMPA EPTSVDACDD STRANARVLR GNVVEALTRL DPQSGAVGPL LATEWSSPDN KSWLFTIRPG VTFHDGTPLD AAAVAFGINR SMNPDLTCQT LSLFPTKTTA TVESDMVVRI TTEEPDPILP ARIAYIDLPS PKTPEAAKSD TPIGTGPYRF AGREIGQSIT LSAFDGYWGE APEIAEANYV WRAEATIRAS MIKTGEADIA YDIPTHEAEG QANAQQYLTN GVFYLRPMLQ KPPLDDLRVR QAIASSIDKA TLAEVLMDNS GTPTGQLVTQ LINGYVPDYT GMPYDLEKAK ALFAEAKAAG VAVDTPITLV ARTDLFSGAE EVSQAIQQMI QQAGFTVTLK SVDTVGWSPW ARKPDSLTQP VNLLTSSHNN ISGDGSLTFP NFLGSGGRLS VVDNAELDAK LAAAAKASGE DRAAAYREIA QYAYDQELVI PVAALQGLLL TSDRIAYEAD GFTDIELHLS DVKHKQ
|
| |