Gene Francci3_1524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1524 
Symbol 
ID3904990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1824983 
End bp1826509 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content70% 
IMG OID637878861 
Producthypothetical protein 
Protein accessionYP_480629 
Protein GI86740229 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00211809 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGCCA CGCTAGCGCT GGGTCTGCTG CTGCAGGGGC TCGCCTTGGT CGTGGTGGTT 
CGCGGTCGGG GCCGGCACGC CGTGTCCAGT CTCGGGCTGA TGTTCGTCGG TGCCGCCATC
GTCTACCACG GGGTGACGGA GGTCCTGCAG GTCCTGGTGC CGTCCTACAG TGACAACCGG
CTGTTGACCA CGGAGTCCGA CGTCGCGGCC TACACCGTTC TGGTCGGGTT CTCCCTGCTC
GCGTTCGCTT TCGGTTATCG CTTCGCGACC CCGCGCACGC CGCCGTCCAC CGGTTTCCGC
CGGGAGGAAG TCTTGGACTT CTTCGACTGG CGGGTCCTGA CTCCGCTGGC CCTGGCTGCG
GTGGCCGTGA CAGCGGTCGG CAGGAACACC TCGCCCGGCC ACGCGGACCC CTCGACGGTC
ACCGGCAGTC CCTACCTCGT CAGCGGCTTC GCGACGCAGT TCCTGGTCGT CGGCCTCGCC
CTGGGAAGCT TCGCCGTGCT CGTGCGGACG AAGGGCCGCG GATTCCTCCC GGTGCTCGGC
GTCCAGTGTA CGTTGCACAC ACTGGCGGGG CAGCGGCTGC CGGTGGCGAT AGCCGCGGGC
GCGGTGATCT ACCTACTGTC CATCGTCGGG ATTCCCATAC GCCGACGGCA GCTCGTCTCG
GTCGTCGCGC TCGTGGTTCT GGCCTACGTC GTCATCTACG GGGCACGCGC GGACGCGGGC
CGGCAGGTGT TCGGGTACAG CGTGGGTCCC GGGCAGCGCC TGCAGGCCCT GGCCTCCGGT
CTGACCCATC TGGAAGGCGG GATAAACCCG GGCGAGGTGG GAGACCTCGG GGTCCGCCTG
GACGGGAACT CCTACCCGTC GATAATCCTG CGACGGCTCC GCGACGGTTC TCCGCCCATC
GGTCCGGTAA CCCTGTGGAA CGACGTCAAC ATCGCCGTCC CGCGTTTTCT CAATCCCGAC
AAACTGAATT CCGACCTGGA GTCGCGTTCC CTCAAGACGC GCCTGTCCGA CACCTACGGG
ATCACCAACG CGTTCGACCG CCTGCCCACC CAGCTCGGTG AGCTGCTGCC CATCGGCGGT
CCGCCGTGGA TGGTGACCCT CGCGGCTCTG GCAGGATTCG TCCTCGTCCG ACTGGAGTAC
GCGCTGCGCG AATGTCGCCA TCCAGCGGCT CTCCTGGGCC TGCTGGCGCT GGTGGCCGCG
ATCCTGCAGT ACGAAGGCGG GATCGCGCTC TACACAATCA ACGGCCGCGG TGTCCTGGCC
ATCGCGGCCG GACTGTTCGT GTGGCGGAAA CGACGGATCT TCCGGCCCGC GACGGTTCGG
GGGCTGACCA CCCTACCGCC GCTGTACACC GGCACCCCGG CCACCGTCGC GCGGGCCGGC
TCCGACGAGG ACGGCACACC GACCAACCCG TCGCAGCCCG GCCCGTCGCC GGGTGACCCG
GTGCGGACCG AGCCGGCGTC CGCTGGCAGG GACGAGGCCG CGGCGCAGGT CACGGCCTGC
GGCCACGGCC CACGCCGCAA CCGGTGA
 
Protein sequence
MSATLALGLL LQGLALVVVV RGRGRHAVSS LGLMFVGAAI VYHGVTEVLQ VLVPSYSDNR 
LLTTESDVAA YTVLVGFSLL AFAFGYRFAT PRTPPSTGFR REEVLDFFDW RVLTPLALAA
VAVTAVGRNT SPGHADPSTV TGSPYLVSGF ATQFLVVGLA LGSFAVLVRT KGRGFLPVLG
VQCTLHTLAG QRLPVAIAAG AVIYLLSIVG IPIRRRQLVS VVALVVLAYV VIYGARADAG
RQVFGYSVGP GQRLQALASG LTHLEGGINP GEVGDLGVRL DGNSYPSIIL RRLRDGSPPI
GPVTLWNDVN IAVPRFLNPD KLNSDLESRS LKTRLSDTYG ITNAFDRLPT QLGELLPIGG
PPWMVTLAAL AGFVLVRLEY ALRECRHPAA LLGLLALVAA ILQYEGGIAL YTINGRGVLA
IAAGLFVWRK RRIFRPATVR GLTTLPPLYT GTPATVARAG SDEDGTPTNP SQPGPSPGDP
VRTEPASAGR DEAAAQVTAC GHGPRRNR