Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4122 |
Symbol | |
ID | 3907087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4924560 |
End bp | 4925798 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637881450 |
Product | transposase IS116/IS110/IS902 |
Protein accession | YP_483199 |
Protein GI | 86742799 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTGT TATTCCCACG GTGCGCGGGC CTTGACGTGC ACCGGGACAC GGTCGCCGCG GCGGTGCGCA TCCAAACCGG TTCGGGTAAA GCCGTCACCG AGGTCCGGAC GTTCACCACG ACCGGGGGCT CGCTCGGACT ACTCGCGGAC TGGCTCACCG AGTGCCGGGT GACGATCGTC GGGATGGAAT CGACAGGCGT CTACTGGAAG CCGGTGTTCC ACCTGCTGGA AGACCGGTTC GAGTGCTGGC TCCTCAACGC CACCCACGTC CGCAACGTAC CAGGCCGAAA AACAGACGTC GCGGACGCGG CGTGGATCTC GGACCTCGTC GCGCACGGCC TGGTACGCGC CTCGTTCGTG CCGCCGAAAC CCCAGCGGGA CCTGCGTGAC CTGACCCGGG CCCGGCGGAT CGTGGTCGAG GAGAAGACCC GGGAGATCCA GCGGCTGGAG AAACTGATGC AGGACGCCGG CGTGAAACTC ACCAGCGTCG CCTCCAAGCT ACTCGGGGTC TCGGGCCGTG CGATCCTGGA GAAGATGATC GAGGGAGAGC AGTCCCTGGA ATATCTCGCT GATCAGGCCC GTGGCCGACT CCGCAGCAAG ATCCCACAGT TGCAGGAGGC ACGCGCGGGA ACGTTCCGCT CCGGGCATCA CGGGTTCCTC GCCGCGCAGC TCCTGGCCCG GATCGACCTG TGTGACGAGC AGATCGACGA GCTCGACCAC CGGATCGAGG TGATGATCGC CCCTTTTCGG GAGACGGTCG ACCGGATCCG CACGATCACC GGGGTCGGTG AGGTCACCGC GACCGTGCTG CTCGCCGAGG TCGGCCTGGA CATGAGCCGG TTCCCCACCG CCGGCCATCT CGCGTCCTGG GCGGGTATCT GTCCGGGGAA CAACACCTCG GGAGGGAAAC GCCTGTCCGG GCGGACCCGA CACGGTAACA AGTGGTTACG TACCGCGTTG ACCGAGGCCG CGCACGCCGC CGCCCGGAGC AAGGACACCT ACCTGGCGTC CCACCACGCC CAGGTCCGTG GCCGCCGCGG TGTCCTGAAA GCGATCGGCG CGACCCGCCA CGACATTCTC ATCGCCTACT GGCACATCAT CGCGAACAAG ACCGTCTACC AGGACCTCGG CGGAGACTGG CATGCCCGCC GACGCCGGGA CCCTGAACGC CGCCGGAAGA ACCTCGTCGG CGAACTGGAG AAACTCGGCT ACACCGTCAC CATCACACCA GCGGCATAG
|
Protein sequence | MQVLFPRCAG LDVHRDTVAA AVRIQTGSGK AVTEVRTFTT TGGSLGLLAD WLTECRVTIV GMESTGVYWK PVFHLLEDRF ECWLLNATHV RNVPGRKTDV ADAAWISDLV AHGLVRASFV PPKPQRDLRD LTRARRIVVE EKTREIQRLE KLMQDAGVKL TSVASKLLGV SGRAILEKMI EGEQSLEYLA DQARGRLRSK IPQLQEARAG TFRSGHHGFL AAQLLARIDL CDEQIDELDH RIEVMIAPFR ETVDRIRTIT GVGEVTATVL LAEVGLDMSR FPTAGHLASW AGICPGNNTS GGKRLSGRTR HGNKWLRTAL TEAAHAAARS KDTYLASHHA QVRGRRGVLK AIGATRHDIL IAYWHIIANK TVYQDLGGDW HARRRRDPER RRKNLVGELE KLGYTVTITP AA
|
| |