Gene Francci3_1073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1073 
Symbol 
ID3906416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1278220 
End bp1279308 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content76% 
IMG OID637878407 
ProductNHL repeat-containing protein 
Protein accessionYP_480184 
Protein GI86739784 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0871328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGGATCG GCGCCCCGGC GCCCGGCGGG CTGGCCCTGC CGTCGGCGCG GCCGTCGGCG 
TCCCGGCTGT ACGCGCCGCG CGGGGTGTGG CTCGGCGACG ACCTGCTCGT GGTGGCGGAC
TCGGGTAACC ACCGGGTGCT GATCTGGCAT GGTCTGCCGG CCGTCGACGG CGCCCCCGCC
GACGTCGTGC TCGGCCAGGC CGACGCGACG AGCGAAGGGC CGGCGGCCGC GGGGCGCGGG
CCCGAACGCG GCCTGCACCT GCCGACCGGC GTGCTCGTCA CGGATGGCCG GCTCGTGGTC
GCCGATGCCT GGCACCACCG GGTGCTGGTC TGGAACGAGG TCCCGACGGT CACCGACACC
GCCCCCGACC TCGTGCTCGG GCAGCCGGAC GCCGACGCCG TCCGCGAGAA CCGGGGCGGC
CCGTGCGGAC CAGACACCTT CTACTGGCCC TTCGGGGTGG CCGTGGTGGG TGGGCGCTTC
TACGTCGCCG ACACCGGTAA CCGGCGGATC CTGGGCTGGT CGAATGGCCT GCCGTCCTCG
CCCGGCCGGC TACCCGACCT GGTGCTGGGG CAGCCCGATC CCACCCGCCG GGACGAGAAC
CGTGGCGGCG CGGCCGGTCC GGCGAGCTTC CGGTGGCCGC ACGACCTCGC CGGCACCGCC
GACCGGCTGC TGGTCGCGGA CGCCGGCAAC CACCGGCTGC TCGGCTGGGC CCCCCATCCC
GACGCCGACG GCGACGCCGA TCTGGTGCTC GGCCAGCCCG ATCTCGCATG CTCCGGGGAG
TTCCCCTACG CGCCCGGCCG GGCCGACGTC CTGCGCTTCC CCTACGCCGT CGACAGCTAC
GGGCACCTGC TGGCCGTCGC GGACACCGCC AACAACCGCG TCCTGCTGTG GGAGGAGCTG
CCGCGGCGCA GTTCCACGCC CGCGACCGGT GTGCTCGGCC AGCCGTCGTT CGCCGAGACC
GGGGAGAACC GGTGGACCCG CGTCGAGGCC GACACCTTCT GCTGGCCGTA CGGCCTCTCG
GTGCGCGGCG ACCGGCTCGC GGTGGCGGAC TCGGGCAACA ACCGGGTCAT GATCTGGCGG
CGGGTATGA
 
Protein sequence
MWIGAPAPGG LALPSARPSA SRLYAPRGVW LGDDLLVVAD SGNHRVLIWH GLPAVDGAPA 
DVVLGQADAT SEGPAAAGRG PERGLHLPTG VLVTDGRLVV ADAWHHRVLV WNEVPTVTDT
APDLVLGQPD ADAVRENRGG PCGPDTFYWP FGVAVVGGRF YVADTGNRRI LGWSNGLPSS
PGRLPDLVLG QPDPTRRDEN RGGAAGPASF RWPHDLAGTA DRLLVADAGN HRLLGWAPHP
DADGDADLVL GQPDLACSGE FPYAPGRADV LRFPYAVDSY GHLLAVADTA NNRVLLWEEL
PRRSSTPATG VLGQPSFAET GENRWTRVEA DTFCWPYGLS VRGDRLAVAD SGNNRVMIWR
RV