Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3608 |
Symbol | |
ID | 4075035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 658318 |
End bp | 659745 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638005127 |
Product | amino acid carrier protein |
Protein accession | YP_611837 |
Protein GI | 99078579 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1115] Na+/alanine symporter |
TIGRFAM ID | [TIGR00835] amino acid carrier protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.221851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.096261 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCGC TTGAACAATT CTTCGGTGTG ATCGGAGACC TGACCTGGGG ATGGTCCCTG GTCCCGATAC TGGTCATTTT CGGCAGTTTC ATCACCGTCG CCACCGGGTT TGTCCAGATT GAGTTCTTTG GCCGAATGTT CCGCGTCTTG ACCCGCGGTC GTACCGATAC CGGTCAGATT TCCGCGCGCG AGGCGCTCTT GGTTTCTGTT GGGGGGCGCG TGGGCGGTGG CAATATCGCA GGTGTTGCGG TCGCCATTAC ACTGGGTGGC CCTGGGGCAG TGTTCTGGAT GTGGGCGATT GCGCTGGTCG GGATGGCCAC GAGCCTGGTG GAATGTTCTC TTGCGCAACT CTTCAAGCGC GCAGACCCCC TCACGGGACA ATATCGCGGC GGGCCTGCGC GAGCCATCCT GCACGGTTTG GGCGAAAACT ATCGCTGGCT TGCTGTGATC TATGCGATTT GCCTGATTGC GGCCTTCGCG CTTGGGTTCA ACGCGTTTCA GGGCAACACG GTGGCTGGTG CGGCAGAGGC CTCCCTCGGG ATCGATCGCC TGTGGACGGG TATTTTCCTG GCAATTGTCA CCGGTTTCAT CGTTTTTGGT GGTATTCGAC GTATCGCCAA AGCGTCCGAG GTTGTGGTCC CGGTCATGGC GGTTGGGTAT CTTCTGATGG CGCTGATCGT GATCCTCACC AACATCGGCG AGATCCCTCA GGTTCTACGC AGCATTCTTG CCAATGCCTT TGGCTTTGAA GAGGCGGTCA GCGGGGGCAT GGGCGCTGCA CTGGCACAGG GACTGCGCCG CGGGCTCTTT TCAAACGAGG CGGGTCTCGG CTCTGCTCCC AATGTGGCCG CAACAGCTGA GGTGAACCAC CCGATCAGCC AGGGGATCAC GCAGGCGTTT TCTGTGTTCA TTGACACGAT CATCATCTGC AGCTGTACCG CTTTTGTCAT CCTGCTGAGC GATGTTTACG TCCCGGGAGC CACCGATATT GACGGCGTCG CCCTGACCCA GAGCGCAATG GTGTCGCATC TGGGCGCGTG GTCGCAGTAT TTCATGACCT TTGCGATCCT GCTCTTTGCG TTCTCCTCGG TGATCTACAA CTACTACCTC GGAGAGAATG CAATCTCCAT CCTCACCAAG ACCCCTCATG CGATGACGCT GCTGAAGCTG GTGGTCGTGG GGATCGTCTT TCTGGGTGCC GTGGCTCCAA ACGCAACCTC GGTCTTCTTC TTCTCGGATC CGTTGATGGG TGTGTTGGCG CTGGTGAACC TCTTGGCTCT GATGATGCTT TTTCCCATTG CGCTGCGGGT TTTGCAGGAC TTTCGAAAGC AGCTGAAAGC GGGGGTCGAG CGGCCTGTGC TGCGTATGGA AGACTTCCCG GATCTCGATC TGGATCCAAA CGCCTGGCCC AATCGGGGCC AATCCTGA
|
Protein sequence | MQALEQFFGV IGDLTWGWSL VPILVIFGSF ITVATGFVQI EFFGRMFRVL TRGRTDTGQI SAREALLVSV GGRVGGGNIA GVAVAITLGG PGAVFWMWAI ALVGMATSLV ECSLAQLFKR ADPLTGQYRG GPARAILHGL GENYRWLAVI YAICLIAAFA LGFNAFQGNT VAGAAEASLG IDRLWTGIFL AIVTGFIVFG GIRRIAKASE VVVPVMAVGY LLMALIVILT NIGEIPQVLR SILANAFGFE EAVSGGMGAA LAQGLRRGLF SNEAGLGSAP NVAATAEVNH PISQGITQAF SVFIDTIIIC SCTAFVILLS DVYVPGATDI DGVALTQSAM VSHLGAWSQY FMTFAILLFA FSSVIYNYYL GENAISILTK TPHAMTLLKL VVVGIVFLGA VAPNATSVFF FSDPLMGVLA LVNLLALMML FPIALRVLQD FRKQLKAGVE RPVLRMEDFP DLDLDPNAWP NRGQS
|
| |