Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2669 |
Symbol | |
ID | 3904893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3150887 |
End bp | 3152065 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637879994 |
Product | integral membrane protein |
Protein accession | YP_481760 |
Protein GI | 86741360 |
COG category | [S] Function unknown |
COG ID | [COG5563] Predicted integral membrane proteins containing uncharacterized repeats |
TIGRFAM ID | [TIGR02913] probable extracellular repeat, HAF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.652758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.165313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATC GAAGGACATT CATCCGTGCC GCCACGGTGG CCACGACCCT CGCGCCCTGG TCGCTCGGCT ACCTCGGCGC GGCCACCGCG GCGGGCGCGG CCACCGCGGC GCCGTCCTCC CCACCCGCGC CCACCCGACC CGCCGTCGTG GACATTCCAT CCCCGACCGG GGCCTCCGGT CTCCTCACCG GCGGCAACAC CGCAGGCCTT TTCGCGGGCT ACTGGCAGAC CTCATCCTCC GAGCTCAGTG GGTTCGTCTA CCACGGCGGT ACGGTGCGCG GCCTGCCGGG GAACAGCCGG CCGGCCGCGG TGAGCGAGGC CGGCGTGGTC GTCGGCGAGA ACACCACGCG TTACAGCCGG GAGGCGTTCC GCTGGAACCG TGGCGTCTAT CAGTCGCTGG GATTCCTCGG CGGCACCCCC AGTCAGGGCG GACAGTCCAG TACCGCGGTC GACGTCAGCG ACACCGGATT CATCGTGGGG ACCAGCACCA CCAATACGGG CGAGCAGCAC GCCTACCGGT GGTCGAACGG CACGATGACG GATCTGGGCA CGCTTGGCGG ACCGTTGAGT TCCGCCGTGG CCGTCACGGT GCAGGGAAGG GTGGTGGGTA ACAGCCTCAC CCGGGAGGGC GCCAGCCACG GGTTTCTCTG GTCGGCCGGC ACAATATCGG ATCTCGGGAC CCTCGGCGGT TCCTCGACCG TCGTCGCCGA CGTGAACAAC GCCAGGCAGA TCGTGGGGAC CAGTGAGACC TCGGACGGCC ATTCCCGCGC GTTCCTGTGG GACGGCGGCC GCATGACCGA TCTCGGGACC CTCGGCAGTG ACCTGCACAG CGAGGCGGTC GCCGTGAACA GGCTCGGTCA GGTGTTGGTG CGCAGCCTCG GATCGTCGGG TGGCGGGTTC CTGTGGATCG CGGGTCGCCG CATCCCGATC ACCTCACCGC TCGGCCCGCT GGAGCTTCTC GGCCTCAATG ATCACGGTGT GGTGTGCGGG ACGGTGCCCT CGGGTCAGGG CAGCCACGCG TTCCGCTGGT ACCGCGGCCG GCTGACCGAT CTGGGCACCC TCGGAGGACC GGTGAGCACG GGCAACGCCG CCACCCCGAA CAACGTCGTG CTCGGTTCCG CGACCACCAC GGACTCCCCG TTTCCCCATG CGGCGTTCTG GCCGAACACC GGCGCCTGA
|
Protein sequence | MNDRRTFIRA ATVATTLAPW SLGYLGAATA AGAATAAPSS PPAPTRPAVV DIPSPTGASG LLTGGNTAGL FAGYWQTSSS ELSGFVYHGG TVRGLPGNSR PAAVSEAGVV VGENTTRYSR EAFRWNRGVY QSLGFLGGTP SQGGQSSTAV DVSDTGFIVG TSTTNTGEQH AYRWSNGTMT DLGTLGGPLS SAVAVTVQGR VVGNSLTREG ASHGFLWSAG TISDLGTLGG SSTVVADVNN ARQIVGTSET SDGHSRAFLW DGGRMTDLGT LGSDLHSEAV AVNRLGQVLV RSLGSSGGGF LWIAGRRIPI TSPLGPLELL GLNDHGVVCG TVPSGQGSHA FRWYRGRLTD LGTLGGPVST GNAATPNNVV LGSATTTDSP FPHAAFWPNT GA
|
| |