Gene Francci3_4152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4152 
Symbol 
ID3907117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4951639 
End bp4952730 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content72% 
IMG OID637881480 
Producthypothetical protein 
Protein accessionYP_483229 
Protein GI86742829 
COG category[S] Function unknown 
COG ID[COG5563] Predicted integral membrane proteins containing uncharacterized repeats 
TIGRFAM ID[TIGR01643] YD repeat (two copies)
[TIGR02913] probable extracellular repeat, HAF family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAA CAAGCACGTC ACCCCGAGTT GCCGGGACGA CGCGGCGACG GCGAACCCGC 
CGCGTCCGCG GTTTTCGCGC GATCGGTGCT GCGGTGGTGG CGCTGGCCCT CCCGCTGGTC
AGCGCCGGAC CCTCGTCGGC CGGCACCGGC GGGTCGGTGA TCGACCTCGG TACCCTGCCC
GGCGGCAGCA GCAGCGGCGC CTACGACGTG AACAACAGCG CGGTCGTCGT CGGCTACTCG
GCGAGTGCCA GCGGCCGCAA CCACGCCGTC CGGTGGAACA GCCTGGGTGT ACCGACCGAC
CTCGGCACCC TGCCCGGGGA CGAGGCGAGC GTCGCCTTCA CCATCAACGA CGCTGGCACC
GTGACGGTTG GCGCCTCCCT GATTCCCGGT GGCCCCACCC ATCCGGTGGA ATGGGACGCG
GCTGGCCAGA TCACCGCGCT GACGACGCCT GCCGGGAGCA TTCTGAGCCG GGCCTATGCG
GTCAACAACC AGGGCACGGT CATCGGCTTC TGGAGTGGGC CGGACCGGTT GTACCACGCG
CTGCGGTGGA CCTCGGCCAG CACGCCCGTG GCCCTGCCCC AGCTACCGGG GGATACCGCC
AGCTCCGCGG GATGGATCAA CAACAGGGGT GTGATCGTCG GCTATTCGAA GACTGCAGCC
GGCGTTGCGC GGGCCGTCCG GTGGAACCCT GACGGAACCG TCTCCAGACT CGCCGACCTG
CCGGGCAGCG ACTCCAGTGA GGCGAGCGCC GTCAGCGACA CCGGCATCAT CGTCGGTCTC
GCCACCACGG GTGCGAGGTC GCACGCGGTC CGCTGGGACC ACGCCGGTGG GATCACCGAG
CTGCCGCCGC TGCCCGGCGA CACCGGTGCC GGTGCCTATG GCGTCAACGA GCGGGGGATC
GTCATCGGCT TCTTAAGCGC GAGCGACGGC ACCCGCAGCG CGGTGGCGTG GAGTCCGAGC
GGCCAGGTCA CCCGGCTGCC GACCGCCACC GCCGGGCCGG CGGAGGCGTA CGGCGTGAAC
GACCGAGGCG CGGTCGCGGG GTCCTCCACC GCGGCCGACG GATCGGCTCA CGCAACACTC
TGGCTGGCCT GA
 
Protein sequence
MTGTSTSPRV AGTTRRRRTR RVRGFRAIGA AVVALALPLV SAGPSSAGTG GSVIDLGTLP 
GGSSSGAYDV NNSAVVVGYS ASASGRNHAV RWNSLGVPTD LGTLPGDEAS VAFTINDAGT
VTVGASLIPG GPTHPVEWDA AGQITALTTP AGSILSRAYA VNNQGTVIGF WSGPDRLYHA
LRWTSASTPV ALPQLPGDTA SSAGWINNRG VIVGYSKTAA GVARAVRWNP DGTVSRLADL
PGSDSSEASA VSDTGIIVGL ATTGARSHAV RWDHAGGITE LPPLPGDTGA GAYGVNERGI
VIGFLSASDG TRSAVAWSPS GQVTRLPTAT AGPAEAYGVN DRGAVAGSST AADGSAHATL
WLA