Gene Francci3_2669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2669 
Symbol 
ID3904893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3150887 
End bp3152065 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID637879994 
Productintegral membrane protein 
Protein accessionYP_481760 
Protein GI86741360 
COG category[S] Function unknown 
COG ID[COG5563] Predicted integral membrane proteins containing uncharacterized repeats 
TIGRFAM ID[TIGR02913] probable extracellular repeat, HAF family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.652758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.165313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATC GAAGGACATT CATCCGTGCC GCCACGGTGG CCACGACCCT CGCGCCCTGG 
TCGCTCGGCT ACCTCGGCGC GGCCACCGCG GCGGGCGCGG CCACCGCGGC GCCGTCCTCC
CCACCCGCGC CCACCCGACC CGCCGTCGTG GACATTCCAT CCCCGACCGG GGCCTCCGGT
CTCCTCACCG GCGGCAACAC CGCAGGCCTT TTCGCGGGCT ACTGGCAGAC CTCATCCTCC
GAGCTCAGTG GGTTCGTCTA CCACGGCGGT ACGGTGCGCG GCCTGCCGGG GAACAGCCGG
CCGGCCGCGG TGAGCGAGGC CGGCGTGGTC GTCGGCGAGA ACACCACGCG TTACAGCCGG
GAGGCGTTCC GCTGGAACCG TGGCGTCTAT CAGTCGCTGG GATTCCTCGG CGGCACCCCC
AGTCAGGGCG GACAGTCCAG TACCGCGGTC GACGTCAGCG ACACCGGATT CATCGTGGGG
ACCAGCACCA CCAATACGGG CGAGCAGCAC GCCTACCGGT GGTCGAACGG CACGATGACG
GATCTGGGCA CGCTTGGCGG ACCGTTGAGT TCCGCCGTGG CCGTCACGGT GCAGGGAAGG
GTGGTGGGTA ACAGCCTCAC CCGGGAGGGC GCCAGCCACG GGTTTCTCTG GTCGGCCGGC
ACAATATCGG ATCTCGGGAC CCTCGGCGGT TCCTCGACCG TCGTCGCCGA CGTGAACAAC
GCCAGGCAGA TCGTGGGGAC CAGTGAGACC TCGGACGGCC ATTCCCGCGC GTTCCTGTGG
GACGGCGGCC GCATGACCGA TCTCGGGACC CTCGGCAGTG ACCTGCACAG CGAGGCGGTC
GCCGTGAACA GGCTCGGTCA GGTGTTGGTG CGCAGCCTCG GATCGTCGGG TGGCGGGTTC
CTGTGGATCG CGGGTCGCCG CATCCCGATC ACCTCACCGC TCGGCCCGCT GGAGCTTCTC
GGCCTCAATG ATCACGGTGT GGTGTGCGGG ACGGTGCCCT CGGGTCAGGG CAGCCACGCG
TTCCGCTGGT ACCGCGGCCG GCTGACCGAT CTGGGCACCC TCGGAGGACC GGTGAGCACG
GGCAACGCCG CCACCCCGAA CAACGTCGTG CTCGGTTCCG CGACCACCAC GGACTCCCCG
TTTCCCCATG CGGCGTTCTG GCCGAACACC GGCGCCTGA
 
Protein sequence
MNDRRTFIRA ATVATTLAPW SLGYLGAATA AGAATAAPSS PPAPTRPAVV DIPSPTGASG 
LLTGGNTAGL FAGYWQTSSS ELSGFVYHGG TVRGLPGNSR PAAVSEAGVV VGENTTRYSR
EAFRWNRGVY QSLGFLGGTP SQGGQSSTAV DVSDTGFIVG TSTTNTGEQH AYRWSNGTMT
DLGTLGGPLS SAVAVTVQGR VVGNSLTREG ASHGFLWSAG TISDLGTLGG SSTVVADVNN
ARQIVGTSET SDGHSRAFLW DGGRMTDLGT LGSDLHSEAV AVNRLGQVLV RSLGSSGGGF
LWIAGRRIPI TSPLGPLELL GLNDHGVVCG TVPSGQGSHA FRWYRGRLTD LGTLGGPVST
GNAATPNNVV LGSATTTDSP FPHAAFWPNT GA