Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0158 |
Symbol | |
ID | 3903089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 185247 |
End bp | 188675 |
Gene Length | 3429 bp |
Protein Length | 1142 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637877490 |
Product | hypothetical protein |
Protein accession | YP_479279 |
Protein GI | 86738879 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGACG ATCTTCTTGC CAGGGTCCAG GGGCGACTGG AGCTGTTCGC CGCCGAAAGC TCCCCCGAGA TCGTGTTGGC CAACGATGCG GTGTTGGAGG TGATGGGTCT CTTGAACGCC GTCCCGGATC CGTCCATCGA CACGGATGTT CTGCTCGCGG CCGGTCTTCT GTACTGGTGT CGTTATCTTG TGCTGGGAGG TGACGATGGC CGGCCGGACC TGGCGCGGGC ACGGGAACTG CTGGCATCGG TCCACCGGTC GAATCCGACC CTTCTACCCA GCGAGATCCG TGACGCCCTT GACGAGGCTG ACATGAACCC GAGCAGCCGG GAAAGGTTGG CACTTCGCGC CGAACACCTG TACGGCCAGT CGACGAAAAC CGGTGACGTC GACGGTCTCA GCGAGGCGAT CGCCCTGTTC CGACGCGCGT CTGCTGTTAC GCCCGTGAGT CATCCGCTTC ACGTCGGGCT CCTTTCCAAT CTCGCGACGG CGTTACAGAC GCGGTTCGCC TGGACGGGTT CGGACTCCGA CATCGACGAG GCGGTCGACC TCGGCCGGCT GGCCGCCACC CGGGCACCGG AGAACCACCC TTCCCGCTTC CTGGTGCTGT CGGGTCTCAG TGGTTCGCTA TGGGCGCGAT GTGTGCACCG GAAGTCGCCG GCGGACCTGG AGGAATCCCT GCAGGCAATC CGGCAGGCGG TCGCGGTCAT CCCGCCACAA GATCCTAATA GCGGCCGTTA TCTGTCAAAC CTAAGTAACA TTCTGCGTTC CCGGTTCGAG TGGACGGGCG CGCGAGCCGA TCTCGACGAG GCAGTGGAAC AGGGTCGACG CGCCGTCGAT GTGACGCCGG CCCACCATCC TCAGTACGCG ACGATGCTCA CCAATCTGGC CGTCGCGCTG CAGACGCGTT TCGTGCAGGC CGGGGTCTCG ACGGATCTAA CGGCCGCCAT CGATATCTTT GGCCGCGCGG CCACAGTAAC GCCTCCCAGC CATCCGCACT TTCCGGTCGT CTTGGGAAAC TTGAGTGCTG CGCTGCTGGT CCGGGCGCTG CACACCGGGA CGGACTCCGA CCTCACCACG GCGGTGGAGA CGGCCCGGCG GGCAGTGGCT GTGACTCCTC CCGGCAGCCC GGACCGTGCC CGGCGTCTCT CCAATCTGGG CAACATTCTC CGGGCACGTT TTGATCGGGT CGGGCTGCTG GTTGACCTGG ATGAGGCTGT GGAGGTCTGC CGGCAGGCGG TAGTGGCGAC GCCGGCCAGC CACGCGGAAC GGGCCGTGAT ACTGACCAAT CTTGGGGCCG TGGTGGGTCT GCGCGCGGAT CGAATCGGGC GCGCGGCCGA CCTGGACGAG GCAGTCACCG TCGGCCGGCA GGCGGCGGCT GCCACCTCGA CGGAGCACAC GGCTTGGGTC CCCGTCATGG TGAACCTCTG CAGGGCGCTG TCACGACGCG CCCGGCTGGC TGGTACCTCC GCAGATCTGG ACGAGGCGGT GGAAACTGCC CGCGCGGCGC TCGCCGCCGC CGAGGCCAAA GAGAACAGAG CCTTTGTCGC GGCCGCGGCG TCGAATCTCG GCGAGACCCT CCATCTCCGC TTTGACCGGA CAGAAGACAT GCCCGACCTG GACGGTTCCG TGGCGGCGTA CCGGATGGCG GTCGATGCTC GCGGCGATGA CCCGGATGCT GCCACTTCCC TGTCCGGCCT CGGTCTTAGT TTGTGGAACC GTTTCGAACA CACTGGGAGG CCGGCGGATC GGGATGTCAG TATCGCCGTG TTTCGACGTG CCGCCGCGCT GGCAACGGCA GCACCCAGCA TCCGAGCGAA GGCGGCGGGA GCATGGGCGA GCCTGGCGGC CACCGCGGGC GACTGGCAGC AGGCTGTCGC CGGCTACAGC ACCGCTGTGG ACCTGCTGGG ACAGGTCGCG CCGCGCAGCC TTGACCGCGA AGACCAGGAG TATCGTCTCG TCACCCTGTC CCGGCTGGGA TCGCAGGCCG CCGCAGCGTG TTTACAGGTC GGCAAGGTCG AACGCGCCCT GGAACTCTGG GAGCAAGGAC GCGGAGTCAT CCTCGGCCAG ATTCTCGACG CCCGTACGGA CCTCGCCCTC CTTGCCGCAA GAGACCCGGA GAAGGCCGCG CTGTTCAGGC GGCTTCAGGA TGAGTTCGAC GCTCCTCCCG CCTTCGACGG ATCCGACATA CCGCTGGCGG AGCAGGCATC CCCGCCATTC GTCGGAAACG ACGCGACGGG CACGGCACGA CGCGGTGCCG ACCGGCGGCA TGCCCGCGCG GCGCGGTTCG CAAGCCTCGT CTCCGAGATC CGCAGCCTGC CGGACTTCGA GCGTTTTCTT CTGCCACCAA CCATCGACGA TCTTCGGCAC GCCGCGAGCC AGGGGCCGAT CGTGGCCGTC AACGTCAGCG AGATCCGTTG CGATGCCCTG ATCCTGACCA CGGCCGGCGT GCAGCTCCTA CCCCTGCCGG ACCTCACTGA AGAAGCCGTT GGTGACCAGG TCCTCGCTTT CCTGACGGCT GTCGAACGCG GCGACGAGAA GGGGCTCTCG AACGTCTTCG GCTGGCTGTG GGACGTCCTC GCCGGACCGG TGTTGGAACG CCTGGACATC CACGGACCAC CGGCAACGGG CACCTCGTGG CCGCGGATGT GGTGGTGCCT GTCGGGGCTG TTGTCGTTCC TTCCCGTGCA CGCGGCAGGC CATCACCAGG CTCGATTCGA TCCGGCGCCG GACACGCTGA TCGATAGAGT GATCTGTTCG TACACTCCAA CGATCCGCGC CTTGGGCCAT GCTCGGCGCA CCGCGCCCGA CGCGGCGACG CTTGTCGGCC TTCCGTCAGC GAACGACGAG GGCAGGCGCG CGCTGGTCGT GGTGATGCCC CACACCCCCG ACGCCGGCGA CCTGCCCGGC GCACACCTCG AAGCCGCCAT CCTCACGCGG ATTCTGCACG AACGGGTGAG CACGCTGGTC CAGGACAAAG CGACCCGCGC TGCGGTGCTG GCCGCCCTGC CGCAGGCACG CTGGGTGCAC TTCGCCTGCC ATGGCGAGGC AGCCATCTCC GCCCCATCGA CCAGCCGTCT GCTACTACAC GACCAGCCCC TCACCGTTCT GGACGTCAAC CGCCTCAGGC TCACCGACGC CGAACTTGCA TACCTGTCCG CGTGCGAAAC CGCCCGCCCG GGCGGCGAGC TTTCCGACGA GGCGATGCAC CTCGCCTCCG CCTTCCAGCT CGCCGGCTAT CGGCATGTCA TCGCCACGCT GTGGCCCATC AACGACCAGA TCGCCGTCGA CCTCGCGGAA AATATCTACA GGTTCCTAGC CGACGGTAGC GACGTGGCCG CGGCCGTTCA CAACGCCACC CGTGCCCAGC GCAACTACGC GCCGCGATCC CCGTCACAGT GGGCGTCCCA CATCCACGTC GGCGCCTGA
|
Protein sequence | MRDDLLARVQ GRLELFAAES SPEIVLANDA VLEVMGLLNA VPDPSIDTDV LLAAGLLYWC RYLVLGGDDG RPDLARAREL LASVHRSNPT LLPSEIRDAL DEADMNPSSR ERLALRAEHL YGQSTKTGDV DGLSEAIALF RRASAVTPVS HPLHVGLLSN LATALQTRFA WTGSDSDIDE AVDLGRLAAT RAPENHPSRF LVLSGLSGSL WARCVHRKSP ADLEESLQAI RQAVAVIPPQ DPNSGRYLSN LSNILRSRFE WTGARADLDE AVEQGRRAVD VTPAHHPQYA TMLTNLAVAL QTRFVQAGVS TDLTAAIDIF GRAATVTPPS HPHFPVVLGN LSAALLVRAL HTGTDSDLTT AVETARRAVA VTPPGSPDRA RRLSNLGNIL RARFDRVGLL VDLDEAVEVC RQAVVATPAS HAERAVILTN LGAVVGLRAD RIGRAADLDE AVTVGRQAAA ATSTEHTAWV PVMVNLCRAL SRRARLAGTS ADLDEAVETA RAALAAAEAK ENRAFVAAAA SNLGETLHLR FDRTEDMPDL DGSVAAYRMA VDARGDDPDA ATSLSGLGLS LWNRFEHTGR PADRDVSIAV FRRAAALATA APSIRAKAAG AWASLAATAG DWQQAVAGYS TAVDLLGQVA PRSLDREDQE YRLVTLSRLG SQAAAACLQV GKVERALELW EQGRGVILGQ ILDARTDLAL LAARDPEKAA LFRRLQDEFD APPAFDGSDI PLAEQASPPF VGNDATGTAR RGADRRHARA ARFASLVSEI RSLPDFERFL LPPTIDDLRH AASQGPIVAV NVSEIRCDAL ILTTAGVQLL PLPDLTEEAV GDQVLAFLTA VERGDEKGLS NVFGWLWDVL AGPVLERLDI HGPPATGTSW PRMWWCLSGL LSFLPVHAAG HHQARFDPAP DTLIDRVICS YTPTIRALGH ARRTAPDAAT LVGLPSANDE GRRALVVVMP HTPDAGDLPG AHLEAAILTR ILHERVSTLV QDKATRAAVL AALPQARWVH FACHGEAAIS APSTSRLLLH DQPLTVLDVN RLRLTDAELA YLSACETARP GGELSDEAMH LASAFQLAGY RHVIATLWPI NDQIAVDLAE NIYRFLADGS DVAAAVHNAT RAQRNYAPRS PSQWASHIHV GA
|
| |