Gene Francci3_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3074 
Symbol 
ID3904275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3641603 
End bp3642838 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content74% 
IMG OID637880395 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_482160 
Protein GI86741760 
COG category[C] Energy production and conversion 
COG ID[COG1251] NAD(P)H-nitrite reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGAA TAGTCGTCGT GGGCGCGGGA ATGGCGGGGG GCCGGGCCGC CGCCGAGGTG 
CGCGCGCGTG GTTTCGCGGG AGAGGTGATC CTGCTCGGGG CGGAGCCGCA CCGGCCCTAC
GATCGTCCGC CGCTGTCCAA GGCGGTGCTC TCCGGCCGCA TCGACGACAC GACGTTGCCC
TTCGACCTCG ACGGGGTCGA GGTGCGCCTC GGACGGCGGG CGACGGGCCT GCGACCGGGC
ACTGTGGAGA CCGACGCGGG GCCGCTGGAC TACGACGGCC TGGTGCTGGC GACGGGAGCC
GACCCGGTAA CCCTGCCGGG CGACGGCCCA CAGCGGGTCC TGCGTTCCAT TGACGACGCC
CGGGATCTGC GTGGCGCCCT GCGTCCCGGC GTCCGGCTCG TCATCGTTGG TGCCGGCTGG
ATCGGCGCGG AGGTCGCCAC GGCGGCGGCA GCGATCGGTG CCGAGGTCAC CGTCGTGGAG
GCAGCCGCCT CGCCGCTGTT CGCCGCGCTC GGCGCGCAGG TGGGCCGGTG CACGATCGCG
TGGTACGCCC GGGCCGGGGT TACCCTGCGC CTGGCCACGG CCGTGGAGCG GGTCACCCCC
GACGGGCTGG TGCTGGCTGG CGGTGAGGAG CTGATTGCGG ACGAGGTTGT CGTCGGCGTC
GGGGTGCGGC CCGGCACCGC CTGGCTCGCC GGTTCCGGGC TCGCCCTCGA CCGGGGAGTC
GTGGTTGACG AGCATCTCGC CGCCCGGTGG ACCCATGGTG CCGACGGTGC CGACGGCGAC
CGGCCGCCGG TGGTCGCGGT CGGTGACTGT GCTGCTTGGT GGTCGCGTCG CTACGGTCGG
CGGCTGCGGG TCGAGCACTG GGACTGCGCC CAGCAGTCGC CCGCGGTCGC GGTCAGCACG
CTGCTCGGCG GGGAGGCCGT CTATGACCCG GTGCCCTACT TCTGGTCGGA ACAGTTCGGC
CGCATGGTGC AGTTCGCGGG ACTCCCTTCG GCAGAGGCTG CCCCGGTGTT CCGGGGCGAT
CCGGGCGTCG TCGTGCCGCC CGACGGCGGG CGTCCGTCTG GCTGGTCGGC CGGGTGGTTC
GCCCCGGACG GCCGTCTCGA GGCGCTGGTC ACTGTGGGGC GCCCGATGGA CATGGTCGCC
GGTCGCCGGT TGATGGCGGC CGACGGTATC CCGGATCGGG AGCGTTTCGC CGATATCTCC
GTGTCGATGA AGGAACTGGC CGCGGCAAGT CGATGA
 
Protein sequence
MQRIVVVGAG MAGGRAAAEV RARGFAGEVI LLGAEPHRPY DRPPLSKAVL SGRIDDTTLP 
FDLDGVEVRL GRRATGLRPG TVETDAGPLD YDGLVLATGA DPVTLPGDGP QRVLRSIDDA
RDLRGALRPG VRLVIVGAGW IGAEVATAAA AIGAEVTVVE AAASPLFAAL GAQVGRCTIA
WYARAGVTLR LATAVERVTP DGLVLAGGEE LIADEVVVGV GVRPGTAWLA GSGLALDRGV
VVDEHLAARW THGADGADGD RPPVVAVGDC AAWWSRRYGR RLRVEHWDCA QQSPAVAVST
LLGGEAVYDP VPYFWSEQFG RMVQFAGLPS AEAAPVFRGD PGVVVPPDGG RPSGWSAGWF
APDGRLEALV TVGRPMDMVA GRRLMAADGI PDRERFADIS VSMKELAAAS R