Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3655 |
Symbol | |
ID | 4075624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 709241 |
End bp | 710176 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 638005175 |
Product | NLPA lipoprotein |
Protein accession | YP_611884 |
Protein GI | 99078626 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.59768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGC TCATCGCAGC GCTGGCCATG CTGGCCGCCA CCCCGGCCAT GGCGCAGGAT AGAATGACAC TGCTCTTGGA TTGGTTCGTC AATCCCGACC ATGGCCCGAT CATTATTGCC GAGGAAAACG GCTATTTCGC CGAACAGGGC CTCGAGGTCG AGGTTGTCGC TCCCGCAGAC CCATCGGCCC CCCCAAGGCT CGTGGCTGCG GGTCAGGCGG ATCTGGCCGT TTCTTATCAA CCGCAACTTC ACCTTCAGAT CCACGAAGGC CTGCCTCTGA AACGCGTCGG CACCTTGGTT GCAACACCTC TGAACTGTCT TCTGGTGCTA AAGGATGGCC CAATTCAGGA TCTTTCAGAC CTCGCCGGCA AGAAAATCGG CTTCTCCGTG GCCGGTGTCG AAGAGGCCGT GCTGCAAGCG ATGCTGGGCC AGCATGGTGT CTCCTCTGAC GACATTGAGA TGATTAATGT GAACTTCTCT TTGTCGCCCT CTCTGATGTC GGGGCAGGTT GACGCAGTCA TCGGTGCTTT TCGGAACTTT GAGCTGAACC AGATGGAGAT CGAAGGCGTC GAGGGGCGGT GTTTCTACGT CGAAGAGGAA GGTGTGCCTT CTTATGATGA GCTGATTTAT GTCGCCAATC CAGAGCGGAT GGACACGGAC AAAATCGCGC GTTTCCTCGC AGCAACGGAA AAGGCGACCC AGTACATTGT GAACAACCCC GAGAAAAGCT GGGAAATCTT TGCGGCCACC TCGACCGAGC TGCAGGACGA ATTGAATGCA CGCGCATGGG TCGACACGCT GCCTCGCTTT GCGCTGCGCC CGGCCGGGTT TGATGCGGGC CGCTACACAC GGTTTGAGGC CTTTCTCAAA GACAGCGGCA TGATTGACAG CTTGAACCCG GTGGACGCCA TCGCAATCGA CGTGACTGCC CCATGA
|
Protein sequence | MKTLIAALAM LAATPAMAQD RMTLLLDWFV NPDHGPIIIA EENGYFAEQG LEVEVVAPAD PSAPPRLVAA GQADLAVSYQ PQLHLQIHEG LPLKRVGTLV ATPLNCLLVL KDGPIQDLSD LAGKKIGFSV AGVEEAVLQA MLGQHGVSSD DIEMINVNFS LSPSLMSGQV DAVIGAFRNF ELNQMEIEGV EGRCFYVEEE GVPSYDELIY VANPERMDTD KIARFLAATE KATQYIVNNP EKSWEIFAAT STELQDELNA RAWVDTLPRF ALRPAGFDAG RYTRFEAFLK DSGMIDSLNP VDAIAIDVTA P
|
| |