Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4254 |
Symbol | glnL |
ID | 6146202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4350034 |
End bp | 4351083 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619075 |
Product | nitrogen regulation protein NR(II) |
Protein accession | YP_001746199 |
Protein GI | 170681876 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3852] Signal transduction histidine kinase, nitrogen specific |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.388785 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACAG GCACGCAGCC CGATGCTGGG CAGATCCTCA ACTCGCTGAT TAACAGTATT TTGTTAATTG ATGACAATCT GGCGATCCAT TACGCCAACC CCGCCGCGCA ACAACTGCTC GCCCAAAGCT CCCGCAAATT GTTTGGTACG CCGTTACCGG AACTGTTGAG CTACTTCTCA TTAAATATCG AGCTGATGCA AGAAAGTCTG GAGGCGGGGC AAGGCTTTAC CGATAACGAA GTGACGCTGG TCATCGACGG GCGCTCGCAT ATCCTTTCTG TGACGGCTCA GCGTATGCCG GACGGCATGA TCCTGCTGGA GATGGCACCG ATGGATAACC AGCGCCGCTT AAGTCAGGAA CAGCTACAGC ACGCCCAGCA GGTTGCTGCC CGTGATTTAG TGCGCGGCCT GGCACATGAG ATTAAAAATC CGCTTGGCGG TTTACGTGGC GCGGCGCAGC TGCTCAGCAA AGCATTACCT GACCCGTCAC TACTCGAATA TACCAAAGTG ATTATCGAAC AGGCGGACCG GCTGCGAAAT CTGGTCGACC GTCTGTTGGG GCCGCAGCTG CCCGGTACGC GTATTACCGA AAGTATTCAC AAAGTGGCTG AACGCGTGGT GACGCTGGTG TCGATGGAAC TGCCGAGCAA CGTGCGGTTG ATTCGGGATT ACGACCCCAG CCTGCCGGAA CTGGCGCACG ACCCGGATCA AATTGAACAG GTCTTGCTGA ATATTGTGCG CAATGCGCTA CAGGCGCTGG GGCCGGAGGG TGGTGAAATC ATTCTGCGTA CCCGCACCGC GTTTCAACTG ACCTTACACG GCGAGCGTTA TCGGCTGGCG GCGCGAATTG ATGTGGAAGA TAACGGGCCA GGTATTCCGC CTCATTTGCA GGATACGCTG TTTTACCCGA TGGTCAGCGG CCGCGAAGGT GGCACCGGGC TTGGCTTATC CATCGCCCGT AATTTGATTG ATCAGCATTC AGGCAAAATT GAATTTACCA GTTGGCCAGG TCATACCGAG TTCTCGGTTT ACCTGCCTAT CAGGAAATAA
|
Protein sequence | MATGTQPDAG QILNSLINSI LLIDDNLAIH YANPAAQQLL AQSSRKLFGT PLPELLSYFS LNIELMQESL EAGQGFTDNE VTLVIDGRSH ILSVTAQRMP DGMILLEMAP MDNQRRLSQE QLQHAQQVAA RDLVRGLAHE IKNPLGGLRG AAQLLSKALP DPSLLEYTKV IIEQADRLRN LVDRLLGPQL PGTRITESIH KVAERVVTLV SMELPSNVRL IRDYDPSLPE LAHDPDQIEQ VLLNIVRNAL QALGPEGGEI ILRTRTAFQL TLHGERYRLA ARIDVEDNGP GIPPHLQDTL FYPMVSGREG GTGLGLSIAR NLIDQHSGKI EFTSWPGHTE FSVYLPIRK
|
| |